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PESTICIDAL AGENTS 

The present invention relates to materials, agents and 
5 compositions having pesticidal activity which derive from 
bacteria, and more particularly from Xenorhabdus species. 
The invention further relates to organisms and methods 
employing such compounds and compositions. 

There is an ongoing requirement for materials, agents, 
compositions and organisms having pesticidal activity, 
for instance for use in crop protection or insect- 
mediated disease control. Novel materials are required 
to overcome the problem of resistence to existing 
pesticides. Ideally such materials are cheap to produce, 
stable, have a high toxicity (either when used alone or 
in combination) and are effective when taken orally by 
the pest target. Thus any invention which provided 
materials, agents, compositions or organisms in which any 
of these properties was enhanced would represent a step 
forward in the art. 

Xenorhabdus spp. in nature are frequently symbiotically 
associated with a nematode host, and it is known that 
25 this association may be used to control pest activity. 
For instance, it is known that certain Xenorhabdus spp. 
alone are capable of killing an insect host when injected 
into the host's hemocoel. 

30 m addition, one extracellular insecticidal toxin from 
Photorhabdus luminescens has been isolated (this species 
was recently removed from the genus Xenorhabdus, and is 
closely related to the species therein). This toxin' is 
not effective when ingested, but is highly toxic when 

35 injected into certain insect larvae (see Parasites and 
Pathogens of Insects VoJ.2, Eds. Beckage, N. E. et 
al., Academic Press 1993). 



15 
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Also known are certain low-molecular weight heterocyclic 
compounds from P . luminescens and X.nematophilus which 
have antibiotic properties when applied intravenously or 
topically (see Rhodes, S.H. et al., PCT WO 84/01775). 

5 

Unfortunately none of these prior art materials have the 
ideal pesticide characteristics discussed above, and in 
particular, they do not have toxic activity when 
administered orally. 

10 

The present invention provides pesticidal agents and 
compositions from Xenorhabdus species, organisms which 
produce such compounds and compositions, and methods 
which employ these agents, compositions and organisms, 
15 that alleviate some of the problems with the prior art. 

According to one aspect of the present invention there is 
disclosed a method of killing or controlling insect pests 
comprising administering cells from Xenorhabdus species 
20 or pesticidal materials derived or obtainable therefrom, 
orally to the pests. 

A PCT application of CSIRO published as WO 95/00647 
discloses an apparently toxic protein from Xenorhabdus 
25 nematophilus; however no details of the protein's 

toxicity are given, and certainly there is no disclosure 
of its use as an oral insecticide. 

Thus the invention provides an insecticidal composition 
30 adapted for oral administration to an insect, which 

composition comprises a pesticidal material obtainable 
from a Xenorhabdus species, or a pesticidal fragment 
thereof, or a pesticidal variant or derivative of either 
of these. 

35 

The composition may in fact comprise cells of Xenorhabdus 
or alternatively supernatant taken from cultures of cells 
of Xenorhabdus species. However, the composition 
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preferably comprises toxins isolable from Xenorhabdus as 
illustrated hereinafter. Toxic activity has been 
associated with material encoded by the nucleotide 
sequence of Figure 2. Thus, the composition suitably 
5 comprises a pesticidal material which is encoded by all 
or part of the nucleotide sequence of Figure 2. 
Pesticidal fragments as well as variants or derivatives 
of such toxins may also be employed. 

10 The sequence of Figure 2 is of the order of 40kb in 

length. It is believed that this sequence may encode more 
than one protein, each of which may regulate or be 
insecticidal either alone or when presented together. It 
is a matter of routine to determine which parts are 

15 necessary or sufficient for insecticidal activity. 

As used herein the term ^variant" refers to toxins which 
have modified amino acid sequence but which share similar 
activity. Certain amino acids may be replaced with 

20 different amino acidB without altering the nature of the 
activity in a significant way. The replacement may be 
by way of ~ conservative substitution 1 1 where an amino 
acid is replaced with an amino acid of broadly similar 
properties, or there may be some non-conservative 

25 substitutions. In general however, the variants will be 
at least 60% homologous to the native toxin, suitably at 
least 70% homologous and more preferably at least 90% 
homologous . 

30 The term ~ derivative • ' relates to toxins which have been 
modified for example by chemical or biological methods. 

These toxins are novel, and they and the nucleic acids 
which encode them form a further aspect of the invention. 

35 

A preferred Xenorhabdus species is the bacteria 
X.nematophllus. Particular strainB of X.nematophilua? 
which are useful in the context of the invention are 
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ATTC 19061 strain, available from the National Collection 
of Industrial and Marine Bacteria, Aberdeen, Scotland 
(NCIMB ) . In addition, suitable strains include two novel 
strains of Xenorhabdus which were deposited at the NCIMB 
5 on 10 July 1997 and were designated with repository 
numbers NCIMB 40886 and NCIMB 40887. TheBe latter 
strains form a further aspect of the invention. 

All strains have common characteristics aB set out in the 
10 following Table 1. 





Table 


1 








oLIalllD 




Characteristics 


ATCC 19061 


NCIMB 40ao/ 




Gram strain 


negative 


negai. iv e 


neoative 


Shape/size 


rods up to 


rods up to 


rods up to 




4^im long 


4um long 


4^m long 


Motile 


Yes 


Yes 


Yes 


Bio lumines cent 


No 


No 


No 


Colour on NBTA* 


blue 


blue 


blue 


insecticidal on 








ingestion by 


yes 


ye6 


yes 


insects 








Production of 


yes 


yes 


yes 


Antibiotics 








Resistant to 








ampicillin 


ye 8 


yes 


yes 


(50[ig/ml) 








colony 


circular 


circular 


circular 


morphology/ 


convex 


convex 


convex 


colour 


cream 


cream 


cream 



15 *NBTA (Oxoid nutrient agar containing 0.0025% 
bromothymol blue and 0.004% tetrazolium chloride) 



Preferably the pest target is an insect, and more 
preferably it is of the order Lepidoptera, particularly 
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Pieris brassicae , Pieris rapae, or Plutella xylostella or 
the order Diptera, particularly Culex quinquefaciatus. 



In a preferred embodiment of the invention, cella from 
5 Xenorhabdus species or agents derived therefrom are used 
in conjunction with Bacillus thuringiensis as an oral 
pesticide. 

In further embodiments, rather than using Bacillus 
10 thuringiensis itself, pesticidal materials obtainable 
from B. thuringiensis (e.g. delta endotoxins or other 
isolates) are used in conjunction with Xenorhabdus 
species . 

15 The term 'obtainable from 1 is intended to embrace not 

only materials which have been isolated directly from the 
bacterium in question, but also those which have been 
subsequently cloned into and produced by other organisms. 



20 Thus the unexpected discovery that bacteria of the genus 
Xenorhabdus (and materials derived therefrom) have 
pesticidal activity when ingested, and that such bacteria 
and materials can be used advantageously in conjunction 
with B . thuringiensis (and toxins or materials derived 

25 therefrom), forms the basis of a further aspect of the 
present invention. The pesticidal activity of 
B. thuringiensis isolates alone have been well documented. 
However, synergistic pesticidal activity between such 
isolates and bacteria of the Xenorhabdus species (or 

30 materials derived therefrom) has not previously been 
demonstrated. 



In still further embodiments of the invention, culture 
supernatant taken from cultures of Xenorhabdus species, 
35 particularly X. nematophilus , is used in place of cells 
from Xenorhabdus species in the methods above. 
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All of these methods can be employed, inter aliftr in P eBt 
control . 

The invention also makes available pesticidal 
5 compositions comprising cells from Xenorhabdus species, 
preferably x.nematophllus , in combination with fl. 
thuringiensis. As with the methods above, a pesticidal 
toxin from B. thuringiensis (preferably a delta endotoxin) 
may be used as an alternative to B. thuringiensis in the 
10 compositions of the present invention 

Likewise, culture supernatant taken from cultures of 
Xenorhabdus species, preferably, X.nematophilus may be 
used in place of cells from Xenorhabdus species. 

15 

Such compositions can be employed, inter alia, for crop 
protection eg. by spraying crops, or for livestock 
protection. In addition, compositions of the invention 
may be used in vector control. 

20 

The invention further encompasses novel pesticidal agents 
which can be isolated from Xenorhabdus spp. Techniques 
for isolating such agents would be understood by the 
skilled person. 

25 

In particular, such techniques include the separation and 
identification of toxin proteins either at the protein 
level or at the DNA level. 

30 The applicants have cloned and partially sequenced a 

region of DNA from Xenorhabdus NCIMB 40887 which region 
codes for insecticidal activity and this is shown as 
Figure 2 (SEQ ID NO. 1) hereinafter. Thus in a preferred 
embodiment the invention also provides a toxin which is 

35 encoded by DNA of SEQ ID No. 1 or a variant or fragment 
thereof • 
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The invention also provides a recombinant DNA which 
encodes such a toxin. The recombinant DNA of the 
invention may comprise the sequence of Figure 2 or a 
variant or fragment thereof. Other DNA sequences may 
5 encode similar proteins as a result of the degeneracy of 
the genetic code. All such sequences are encompassed by 
the invention. 

The sequence provided herein is sufficient to allow 
10 probes to be produced which can be used to identify and 
subsequently to extract DNA of toxin genes. This DNA may 
then be cloned into vectors and host cells as is 
understood in the art. 

15 DNA which comprises or hybridises with the sequence of 
Figure 2 under stringent conditions forms a further 
aspect of the invention. 

The expression ^hybridises with' 1 means that the 
20 nucleotide sequence will anneal to all or part of the 
sequence of Figure 2 under stringent hybridisation 
conditions, for example those illustrated in ~ Molecular 
Cloning", A Laboratory Manual 1 ' by Sambrook, Fritsch and 
Maniatis, Cold Spring Habor Laboratory Press, Cold Spring 
25 Harbor, N.Y. 

The length of the sequence used in any particular 
analytical technique will depend upon the nature of the 
technique, the degree of complementarity of the sequence, 

30 the nature of the sequence and particularly the GC 
content of the probe or primer and the particular 
hybridisation conditions employed. Under high 
stringency , only sequences which are completely 
complementary will bind but under low stringency 

35 conditions, sequences which are 60% homologous to the 

target sequence, more suitably 80% homologous, will bind. 
Both high and low stringency conditions are encompassed 
by the term ""stringent condit ions' 1 used herein. 

SUBSTITUTE SHEET (RULE 26) 
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Suitable fragments of the DNA of Figure 2, i.e. those 
which encode pesticidal agents may be identified using 
standard techniques. For example, transposon 

5 mutagenesis techniques may be used, for example as 

described by H.S. Siefert et al., Proc. Natl. Acad. Sci. 
USA r <1986) 83, 735-739. Vectors such as the cosmid 
cHRIMI, can be mutated using a variety of transposons and 
then screened for loss of insectidal activity. In this 

10 way regions of DNA encoding proteins responsible for 
toxic activity can be identified. 

For example, the mini-transposon mTn3(HIS3) can be 
introduced into a toxic Xenorhabdus clone such as cHRIMI, 

15 hereinafter referred to as * clone 1 ' , by electroporating 
cHRIMI DNA into E.coll RDP146 (pLBlOl ) and mating this 
strain with E.coli RDP146(pOX38) , followed by E. coli 
NS2114Sm. The final strain will contain cHRIMlDNA with a 
single insertion of the transposon mTn3(HIS3). These 

20 colonies can be cultured and tested for insecticidal 
activity as described in Example 8 hereinafter. 
Restriction mapping or DNA sequencing can be used to 
identify the insertion point of mTn3(HIS3) and hence the 
regions of DNA involved in toxicity. Similar approached 

25 can be used with other transposons such as Tn5 and mTn£. 

Site directed mutagenesis of cHRIMI as outlined in 
^Molecular Cloning, A Laboratory Manual 11 by Maniatis, 
Fritsch and Sambrook, (1982) Cold Spring Harbor, can also 
30 be used to test the importance of specific regions of DNA 
for toxic activity. 

Alternatively, subcloning techniques can be used to 
identify regionB of the cloned DNA which code for 
35 insecticidal activity. In this method, specific smaller 
fragments of the DNA are subcloned and the activity 
determined. To do this, cosmid DNA can be cut with a 
suitable restriction enzyme and ligated into a compatible 
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restriction site on a plasmid vector, such as pUC19. 
The ligation mix can be transformed into E. coli and 
transformed clones selected using a selection marker such 
as antibiotic resistance, which is coded for on the 
5 plasmid vector . Details of these techniques are 

described for example in Maniatis et al, supra, (see 
p390-391) and Methods in Molecular Biology, by L.G. 
Davies, M.D. Dibner and J.F. Battey, Elsevier, (see p222- 
224) . 

10 

Individual colonies containing specific cloned fragments 
can be cultured and tested for activity as described in 
Example 8 hereinafter. Subclones with insecticidal 
activity can be further truncated using the same 
15 methodology to further identify regions of the DNA coding 
for activity. 

The invention also discloses an isolated pesticidal agent 
characterised in that the agent is obtainable from 

20 cultures of X. nematophilus or variants thereof, has oral 
pesticidal activity against Pieris brassicae, Pier is 
rapae and Plutella xylostella, is substantially heat 
stable to 55 °C, is proteinaceous, acts synergistically 
with B.thuringiensis cells as an oral pesticide and is 

25 substantially resistant to proteolysis by trypsin and 
proteinase K. 

By 'substantially heat stable to 55°C is meant that the 
agent retains some pesticidal activity when tested after 
30 heating the agent in suspension to 55 °C for 10 minutes, 
and preferably retains at least 50% of the untreated 
activity. 

By 'substantially resistant to proteolysis 1 is meant that 
35 the agent retains some pesticidal activity when exposed 
to proteases at 30°C for 2 hourB and preferably retains 
at least 50% of the untreated activity. 
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By 'acts synergistically' is meant that the activity of 
the combination of components is greater than one might 
expect from the use of the components individually. For 
example, when used in conjunction with B.thuringiensis 

5 cells as an oral pesticide, the concentration of B. 
thuringiensis cellular material necessary to give 50% 
mortality in a P.brassicae when used alone is reduced by 
at least 80% when it is used in combination the agent at 
a concentration sufficient to give 25% mortality when the 

10 agent is used alone* 

It has been found that the activity of the material is 
retained by 30 kDa cut-off filters but is only partly 
retained by 100 kDa filters. 

15 

Preferably the agent is still further characterised in 
that the pesticidal activity is lost through treatment at 
25°C with sodium dodecyl sulphate (SDS - 0.1% 60 rains) 
and acetone (50%, 60 mins). 

20 

Clearly the characterising properties of the isolated 
agent described above can be utilised to purify it from, 
or enrich its concentration in, Xenorhabdus species cells 
and culture medium supernatants . Methods of purifying 

25 proteins from heterogenous mixtures are well known in the 
art (eg. ammonium sulphate precipitation, proteolysis, 
ultrafiltration with known molecular weight cut-off 
filters, ion-exchange chromatography, gel filtration, 
etc.)- The oral pesticidal activity provides a 

30 convenient method of assaying the level of agent after 
each stage, or in each sample of eluent. Such 
methodology does not require inventive endeavour by those 
skilled in the art. 

35 The invention further discloses oral pesticidal 

compositions comprising one or more agents as described 
above. Such compositions preferably further comprise 
other pesticidal materials from non-Jfenorhabdus species. 
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These other materials may be chosen such as to have 
complementary properties to the agents described above , 
or act synergistically with it. 

5 Preferably the oral pesticidal composition comprises one 
or more pesticidal agents as described above in 
combination with B. thuringiensis (or with a toxin 
derived therefrom, preferably endotoxin). 

10 Recombinant DNA encoding said proteins also forms a 
further aspect of the invention. The DNA may be 
incorporated into an expression vector under the 
influence of suitable control elements such as promoters, 
enhancers, signal sequences etc. as is understood in the 

15 art. These expression vectors form a further aspect of 
the invention. They may be used to transform a host 
organism so as to ensure that the organism produces the 
toxin . 

20 The invention further makes available a host organism 
comprising a nucleotide sequence coding for a pesticial 
agent as described above. 

Methods of cloning the sequence for a characterised 
25 protein into a host organism are well known in the art. 
For instance the protein may be purified and sequenced: 
as activity is not required for sequencing, SDS gel 
electrophoresis followed by blotting of the gel may be 
used to purify the protein. The protein sequence can be 
30 used to generate a nucleotide probe which can itself be 
used to identify suitable genomic fragments from a 
Xenorhabdus gene library. These fragments can then be 
inserted via a suitable vector into a host organism which 
can express the protein. The use of such general 
35 methodology is routine and non-inventive to those skilled 
in the art. Such techniques may be applied to the 
production of Xenorhabdus toxins other than those encoded 
by the sequence of Figure 2. 

SUBSTITUTE SHEET (RULE 26) 
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It may be desirable to manipulate (eg. mutate) the agent 
by altering its gene sequence (and hence protein 
structure) such as to optimise its physical or 
5 toxicological properties. 

It may also be desirable for the host to be engineered or 
selected such that it also expresses other proteinaceous 
pesticidal materials (eg. delta- endotoxin from B. 
10 thuringiensis) . Equally it may be desirable to generate 
host organisms which express fusion proteins composed of 
the active portion of the agent plus these other toxicity 
enhancing materials. 

15 A host may be selected for the purposes of generating 

large quantities of pesticidal materials for purification 
e.g. by using B. thuringiensis transformed with the agent- 
coding gene. Preferably however the host is a plant, 
which would thereby gain improved pest-resistance. 

20 Suitable plant vectors, eg. the Ti plasmid from 

Agrobacterium tumefaciens, are well known in the art. 
Alternatively the host may be selected such as to be 
directly pathogenic to pests, eg. an insect baculovirus. 

25 The teaching and scope of the present invention embraces 
all of these host organisms plus the agents, mutated 
agents or agent-fusion materials which they express. 

Thus the invention makes available methods, compositions, 
30 agents and organisms having industrially applicable 
pesticidal activity, being particularly suited to 
improved crop protection or insect-mediated disease 
control . 

35 The methods, compositions and agents of the present 

invention will now be described, by way of illustration 
only, through reference to the following non-limiting 
examples and figures. Other embodiments falling within 
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the scope of the invention will occur to those skilled in 
the art in the light of these. 

FIGURE 

5 Figure 1 shows the variation with time of the growth of 
X. nematophilus ATCC 19061 and activity of cells and 
supematants against P. brassicae as described in Example 
3. 

10 Figure 2 shows the sequence of a major part of a cloned 
toxin gene from Xenorhabdus. 

Figure 3 shows a comparison of the restriction maps of 
cloned toxin genes from two strains of Xenorhabdus 
15 (clone 1 above and clone 3 below). 



EXAMPLES 

20 

Example 1 - Use of X. nematophilus cells as an oral 
insecticide 

CELL GROWTH: A subculture of X. nematophilus (ATCC 19061, 
Strain 9965 available from the National Collections of 
Industrial and Marine Bacteria, Aberdeen, Scotland) was 
used to inoculate 250 ml Erlenmeyer flasks each 
containing 50 ml of Luria Broth containing lOg tryptone, 
5g yeast extract and 5g NaCI per litre. Cultures were 
grown in the flasks at 27°C for 40hrs on a rotary shaker. 

PRODUCTION OF CELL SUSPENSION: Cultures were centrifuged 
at 5000 x a for 10 mins. The supematants were discarded 
and the cell pellets washed once and resuspended in an 
35 equal volume of phosphate buffered saline (8g NaCl, 1.44g 
Na 2 HP0 4 and 0.24g of KH 2 P0 4 per litre) at pH 7.4. 



25 



30 
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ACTIVITY OF CELL SUSPENSION TO INSECTS: The bioassays 
were as follows: P. brassicae: The larvae were allowed 
to feed on an artificial agar-based diet (as described by 
David and Gardiner (1965) London Nature, 207, 882-883) 

5 into which a series of dilutions of cell suspension had 
been incorporated. The bioassays were performed using a 
series of 5 doses with a minimum of 25 larvae per dose. 
Untreated and heat-treated (55°C for 10 minutes) cells 
were tested. Mortality was recorded after 2 and 4 days 

10 with the temperature maintained at 25 °C. 



LC50 cells/g diet 

Treatment 2 davs 4 <foyp 

Untreated 5.9 x 10 5 9.8 x 10 4 

15 Treated 55°C 7.1 x 10 5 1.4 x 10 5 



Aedes aegyptli The larva were exposed to a series of 5 
different dilutions of cell suspension in deionised 
water. The biosassays were performed using 2 doses per 
20 dilution of 50 ml cell suspension in 9.5cm plastic cups 
with 25 second instar larvae per dose. Untreated and 
heat-treated (55°C or 80°C for 10 minutes) cells were 
tested. Mortality was recorded after 2 days with the 
temperature maintained at 25 °C. 



25 



LC50 cells/ml 



Treatment 2 dftyg 

Untreated 5.1 x 10 

Treated 55°C 7.4 x 10 6 

30 Treated 80°C > 1° 



Culex quinq uefaclatus: The larvae were exposed to a 
single concentration cell suspension containing 4 xlO 
cells/ml. The biosassays were performed using 2 50 ml 
35 cell suspensions in 9.5 cm plastic cups with 25 second 
instar larvae per cup. Untreated and heat-treated (55°C 
or 80°C for 10 minutes) cell6 were tested. Mortality was 
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recorded after 2 days with the temperature maintained at 
25°C. 

% Mortality 

5 Treatment 2 days 

Untreated 100 

Treated 55°C 100 

Treated 80 °C 0 

10 Thus these results clearly show that cells from X. 

nematophilus are effective as an oral insecticide against 
a number of insect species (and are particularly potent 
against P. brassicae) > The insecticidal activity is not 
dependent on cell viability (i.e is largely unaffected by 

15 heating to 55°C which reduces cell viability by >99.99%) 
but is much reduced by heating to 80°C, which denatures 
most proteins* 

Example 2 - Use of X, nematophilus supernatant as an oral 
20 insecticide 

CELL GROWTH: Cultures were grown as in Example 1. 

PRODUCTION OF SUPERNATANT: Cultures were centrifuged 
25 twice at lOOOOg for 10 mins. The cell pellets were 
discarded. 

ACTIVITY OF SUPERNATANT TO INSECTS : The Bioassay was as 
follows : 

30 Activity against neonate P. brassicae and two day old 

Pieris rapae and Plutella xylostella larvae was measured 
as for P. brassicae in Example 1, but using a series of 
untreated dilutions of supernatant in place of of cell 
supensions and with mortality being recorded after 4 days 

35 only • 
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LC50 (jil supernatant/g diet) 
Insect species * days 

P. brassicae 22 
5 P. rapae 79 
P. xylosteiia I 35 

In addition, size-reducing activity (62% reduction in 7 
days) against Mamestra brassicae was detected in larvae 
10 fed on an artificial diet containing X. nematophilus 
supernatant (results not shown). 

Thus these results clearly show that the supernatant from 
X. nematophilus culture medium is effective as an oral 
15 insecticide against a number of insect species, and are 
particularly potent against P. brassicae. 

The heating of supernatant s to 55 °C for 10 minutes caused 
a partial loss of activity while B0°C caused complete 
loss of activity. Activity was also completely lost by 
treatment with SDS (0.1%w/v for 60 mins ) and Acetone (50% 
v/v for 60 mins) but was unaffected by Triton X-100 (0.1% 
60 mins), non-diet P40 (0.1% 60 mins), NaCl ( 1 M f or 60 
mins) or cold storage at 4°C or -20°C for 2 weeks. All 
25 of these properties are consistent with a proteinaceous 
agent . 

The general mode of action of X. nematophilus cells and 
supernatants i.e. reduction in larval size and death 
within 2 days at high dosages, and other properties, eg. 
temperature resistence, appear to be similar suggesting a 
single agent or type of agent may be responsible for the 
oral insecticide activity activities of both cells and 
supernatants . 



20 



30 



35 



Example 3 - Timescale for appearance of ingestable 
inBecticidal activity 
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CELL GROWTH: 1ml of an overnight culture of X. 
nematophllus was used to inoculate an Erlenmeyer flask* 
Cells were then cultured as in Example 1. Growth was 
estimated by measuring the optical density at 600 nm. 

5 

PRODUCTION OF CELL SUSPENSION AND SUPERNATANTS: These 
were produced as in Examples 1 and 2. 

ACTIVITY OF CELLS AND SUPERNATANTS AGAINST P. BRASSICAE: 
10 The cell suspension bioassay was carried out as in 

Example 1, but using a single dose of suspended cells 
equivalent to 50 yl of broth/g diet and measuring 
mortality after 2 days- The cell supernatant bioassay 
was carried out as in Example 2, but using a single dose 
15 equivalent to 50 pi supernatant/g diet (i.e. more than 
twice the LC50) and measuring mortality after 2 days. 

The results are shown in Fig. 1. Thus these results 
clearly show that cells taken from X. nematophllus 

20 culture medium are highly effective as an oral 

insecticide against P. brassicae after only 5 hours, and 
supernatants are highly effective after 20 hours. 
Although some slight cell lysis was observed in the early 
stages of growth, no significant cell lysis was observed 

25 after this point demonstrating that the supernatant 

activity may be due to an authentic extracellular agent 
(as opposed to one released only after cell breakdown). 

Example 4 - Synergy between X. nematophilus cells and 
30 B.thuringiensis powder preparations 

CELL GROWTH AND SUSPENSION: X. nematophllus cells were 
grown and suspended as in Example 1. B. thurlngiensis 
strain HD1 (from Bacillus Genetic Stock Centre, The Ohio 
35 State University, Columbus, Ohio 43210, USA) was 

cultured, harvested and formulated into a powder as 
described by Dulmage et al. (1970) J. Invertebrate 
Pathology 15, 15-20. 
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ACTIVITY OP X. NBMATOPHILUS CELLS AND B ♦ THURINGIENSIS 
POWDER AGAINST P. BRASS JCAE: The bioassays was carried 
out using X. nematophilus and B. thuringiensis in 

5 combination or using B» thuringiensis cell powder alone. 
Bioassays were carried out as in Example 1 but with 
various dilutions of B. thuringiensis powder in place of 
X. nematophilus. For the combination experiment, a 
constant dose of X. nematophilus cell suspension 

10 sufficient to give 25% mortaility was also added to the 
diet. Mortality was recorded after 2 days. 



Picas say 

15 B.t. alone 

B.t. plus X. nematophilus 

These results clearly demonstrate the synergism between 
X. nematophilus cells and B. thuringiensis powder when 
20 acting as an oral insecticide against P. brassicae. 

Example 5 - Synergy between of X. nematophilus 
supernatants and B. thuringiensis powder 

25 CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 

nematophilus cells were grown and supernatants prepared 
as in Example 2. B. thuringiensis was grown and treated 
as in Example 4. 

ACTIVITY OF X. NEMATOPHILUS SUPERNATANTS AND Bt CELL 
POWDER AGAINST P. BRASSICAEx 

The bioassays were carried out using X. nematophilus 
supernatants and B. thuringiensis in combination or using 
B. thuringiensis powder alone. The Bioassay against 
neonate P. brassicae and two day old Pieris rapae and 
Plutella xylostella larvae were measured as in Example 2 
but with various dilutions of B. thuringiensis in place 
of X. nematophilus. For the combination experiment, a 



LC50 (jig Bt powder/g diet) 
2 days 
1*7 
0.09 



30 



35 
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constant dose of X. nematophilus supernatant sufficient 
to give 25% mortality was also added to the diet. 
Mortality was recorded after 4 days. 

5 

diet 

Insect species 
P. brassicae 
P. rapae 
10 p. xylostella 

These results clearly demonstrate the synergism between 
X. nematophilus supernatants and B.thuringiensis powder 
when acting as an oral insecticide against several insect 
15 species. The fact that both X. nematophilus cells and 
supernatants demonstrate this synergism strongly suggests 
that a single agent or type of agent is responsible for 
the demonstrated activities. 



LC 50 (pg Bt powder/g) 

Bt alone Bt plug KD 

1.4 0.12 

2.5 0.26 
7.2 0.63 



20 



Example 5 - Characterisation of insecticidal agent from 
X. nematophilus supernatant by proteolysis 



CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 
nematophilus cells were grown and supernatants prepared 
25 as in Example 2. 



PROTEOLYSIS OF SUPERNATANT: Culture supernatant (50ml) 
was dialysed against 0.5 M NaCl (3x11) for 48 hours at 
4°C. The volume of the supernatant in the dialysis tube 
30 was reduced five-fold by covering with polyethylene 

glycol 8000 (Sigma chemicals). Samples were removed and 
treated with either trypsin (Sigma T8253 = 10 , 000 
units/mg) or proteinase K (Sigma P0390 = 10 units/mg) at 
a concentration of 0.1 mg protease/ml sample for 2 hours 

35 at 30°C. 

ACTIVITY OF PROTEASE TREATED SUPERNATANT AGAINST P. 
BRASSICAE : The boassay against neonate P. brassicae 
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larvae was carried out by spreading 25 pi of each 
'treatment* on the artificial agar-based diet referred to 
in Example 1 in a 4.5 cm diameter plastic pot. Four pots 
each containing 10 larvae were used for each treatment. 
5 Mortalities were recorded after 1 and 2 days. Controls 
using water only, trypsin (0.1 mg/ml) and proteinase K 
(0.1 mg/ml) were also tested in the same way. 





% Mortality 




Treatment 


1 day 


2 days 


Untreated supernatant 


60 


100 


Proteinase K treated supernatant 


45 


100 


Trypsin treated supernatant 


40 


100 


All controls (no supernatant) 


0 


0 



Example 6 

Entomocidal activity of other Xenorhabdus 

Using the methodology of Examples 1 and 2, four different 
20 xenorhabdus strains were tested against insect pests. 
The results obtained were as follows: 
I) Activity to Pieris brassicae 



Strain deposit 
no/code 


Cells 10 C /grm diet 
% mortality 


Supernatant LC50 
Hl/gram of diet 


NCIMB 40887 


100 


0.09 


0014 


100 


0.52 


0015 


80 


3.73 


NCIMB 40886 


100 


0.05 



25 It was found that entomocidal activity of cells and 

supernatant waB reduced by more than 99% when all four 
strains were heated at B0°C for 10 minutes. 
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II) Activity to mosquitoes (Aedes aegyptl) 
Bacteria added at the rate of 10 7 cells/ml of water 



Strain deposit 


Cells 10 6 /grm diet 


no/code 


% mortality 


NCIMB 40887 


0 


0014 


40 


0015 


45 


NCIMB 40886 


95 



5 Furthermore, all strains significantly reduced the growth 
of Heliothis virescens . 

Example i 

cloning of toxin genes from strains of XenQrhabdUS 
10 Total cellular DNA was isolated from NCIMB 40887 and ATCC 
19061 using a Quiagen genomic purification DNA kit. 
Cells were grown in L borth (lOg tryptone, 5g yeast 
extract and 5g NaCl per 1) at 28°C with shaking (ISOrpm) 
to an optical density of 1.5 A 60 o- Cultures were 
15 harvested by centrif ugation at 4000xg and resuspended in 
3.5mls of buffer Bl (50mM Tris/HCl, 0.05% Tween 20 , 0.5% 
Triton X-100, pH7.0) and incubated for 30 mins at 50°C. 
DNA was isolated from bacterial lysates using Quiagen 
100/G tips as per manufacturers instructions. The 
20 resulting purified DNA was stored at -20°C in TE buffer 
(lOmM Tris, ImM EDTA, pH 8.0). 

A representative DNA library was produced using total DNA 
of NCIMB 40887 and ATTC 19061 partially digested with the 

25 restriction enzyme Sau3a. Approximately 20|ig of DNA from 
each strain was incubated at 37°C with 0.25 units of the 
enzyme. At time intervals of 10, 20, 30, 45 and 60 
minutes, samples were withdrawn and heated at 65°C for 15 
minutes. To visualise the size of the DNA fragments, the 

30 samples were electrophoresed on 0.5% w/v agarose gels. 

SUBSTITUTE SHEET (RULE 26) 



WO 98/08388 



PCT/GB97/02284 



22 

The DNA samples which contained the highest proportion of 
30 to 50kb fragments were combined and treated with 4 
units of shrimp alkaline phosphatase (Boehringer) for 15 
minutes at 37°C, followed by heat treatment at 65°C to 
5 inactivate the phosphatase. 

The size selected DNA fragments were ligated into the 
BamHl site of the cosmid vector SuperCos ! (Stratagent) 
and packaged into the Escherichia coli strain XL Blue 1, 
10 using a Gigapack II packaging kit (Stratgene) in 
accordance with the manufacturers instructions. 

To select for cosmid clones with entomocidal activity, 
individual colonies selected on L agar plates containing 

15 25*ig/ml ampicillin, were grown in L broth (containing 
25|ig/ml ampicillin) overnight at 28°C. Broth cultures 
(50*11) were individually spread onto the surface of 
insect diet contained in 4.5cm diameter pots, as 
described in Example 5. To each container 10 neonate P. 

20 brassicae larvae were added. Larvae were examined after 
24 , 72 and 96 hours recording mortality and size of 
surviving larvae. A total of 220 clones of NCIMB 408B7 
were tested, of which two were found to cause reduction 
in larval growth and death within 72 hours. Of 370 

25 clones from ATTC 19061, one was found to cause larval 
death within 72 hours. 

Example 8 

Ar^ivity of clo ned t oxin genes tg f iertff frrafffficae 
30 The three active clones from Example 7 were grown in L 
broth, containing 25*ig/ml ampicillin, for 24 hours at 
28°C, on a rotary shaker at ISOrpm. The activity of the 
toxin clones to neonate larvae were performed by 
incorporation of whole broth cultures into insect diet, 
35 as described in Example 1. 
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Clone HQ 



Strain 



LC50 (ul b roth /p insect diet) 



1 



NCIMB 40887 



13.03 



2 



NCIMB 40887 



16.7 



3 



ATTC 19061 



108.7 



Control* 



No effect at lOOjil/g 



*XL1 Blue E. coll broth 

5 

When E. coli toxin clones were heated at 80°C for 10 
minutes and added to the diet at a rate of 100^1/g, no 
activity to larvae was detected. Highlighting the heat 
sensitivity of the toxin6. 

10 

Example 9 

Sequencing of the cloned toxin from NCIMB 40887 

Cosraid DNA of the entomocidal clone 1 above from NCIMB 

15 40887 was purified using the Wizard Plus SV DNA system 
(Promega) in accordance with the manufacturers 
instructions. A partial map of the cloned fragment was 
obtained using a range of restriction enzymes EcoRl, 
BamHl, ffindlll, Sail and Sacl as shown in Figure 3. DNA 

20 sequencing was intiatiated from pUC18 and pUC19 based 

sub-clones of the cosmid, using the enzymes EcoRl, BamHl, 
Hlndlll, EcoRV and PvuII. Sequence gaps were filled 
using a primer walking approach on purified cosmid DNA. 
Sequence reactions were performed using the ABI PRISM™ 

25 Dye Terminator Cycle Sequencing Ready Reaction Kit with 
AmmpliTaq DNA polymerase FS according to the 
manufacturers instructions. The samples were analysed on 
an ABI automated sequencer according to the manufacturers 
instructions. The major part of the DNA sequence for the 

30 cloned toxin fragment is shown in Figure 2. 



SUBSTITUTE SHEET (RULE 26) 



WO 98/08388 



24 



PCT/GB97/02284 



Example 10 

R e striction map »f tinned toxin from clone 3 

Cosmid DNA of the entomocidal clone 3 above was purified 

5 as described in Example 9. A restriction map of the 
cloned fragment was obtained using the restriction 
enzymes flamHl, Hindlll, Sail and Sacl and this is shown 
in Figure 3. When compared with the map from clone 1 
(Figure 3) it is clear that over the regions which 

10 overlap, the restriction maps are very similar. The 
only detectable difference between the two clones was a 
reduction in size of two Hindlll fragments in clone 3, 
corresponding to the 11.4kb and 7.2kb Hindlll fragments 
in clone 1 by approximately 2Kb and 200bp respectively. 

15 These results indicate the overall relatedness of the DNA 
region coding for toxicity in the two bacterial strains. 



Example 11 

Southern Blot Hybridi Ration Experiments 
20 A 10.3kb BamHl-Sail fragment of the DNA from clone 1 was 
used as a probe to hybidise to total Hindlll digested DNA 
of the Xenorhabdus strains ATCC 19061, NCIMB 40886 and 
NCIMB 40887. Hybridisation was performed with 20ng/ml of 
DIG labelled DNA probe at 65°C for 18 hours. Filters 
25 were washed prior to immunological detection twice for 5 
minutes with 2 x SSC (0.3M NaCl, 30mM sodium citrate, pH 
7.0)/0.1% (w/v) sodium dodecyl sulphate at room 
temperature, and twice for 15 minutes with 0.1 x SSC 
(15mM NaClm 1.5 mM sodium citrate, pH 7.0) plus 0.1% 

30 sodium dodecyl sulphate at 65°C. The probe was labelled 
and experiments performed in accordance with 
manufacturers instructions, using a non-radioactive DIG 
DNA labelling and detection kit (Boehringer ) . The probe 
hybridised to a Hindlll fragment of approximately 8kb in 

35 all three strains as well as an 11.4kb fragment in NCIMB 
40887 and an approximate 9kb fragment in both NCIMB 40886 
and ATCC 19061. These results show that strains NCIMB 
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40886 and ATCC 19061 contain DNA with close homology to 
the toxin gene of clone 1 above, confirming the 
similarity between the toxins produced by the three 
strains . 
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CLAIMS 

1. An insecticidal composition adapted for oral 
5 administration to an insect comprising a pesticidal 
material obtainable from a Xenorhabdus species, or a 
pesticidal fragment thereof, or a pesticidal variant or 
derivative of either of these. 

10 2. A composition according to claim 1 wherein the said 
pesticidal material comprises material encoded by the 
nucleotide sequence of Figure 2 or variant or fragment 
thereof, or a sequence which hybridises with said 
sequence . 

15 

3. A composition according to claim 1 or claim 2 which 
comprises cells of Xenorhabdus. 

4. A composition as claimed in any one of the 

20 preceding claims which comprises supernatant taken from 
cultures of cells of Xenorhabdus species. 

5. A composition according to any one of the preceding 
claims wherein the Xenorhabdus species is Xenorhabdus 

25 nematophilus. 

6. A composition according to any one of claims 1 to 4 
wherein the Xenorhabdus species is ATCC 19061, NCIMB 
40886 or NCIMB 40887. 

30 

7. A composition as claimed in any one of the preceding 
claims which comprises a further pesticidal material not 
obtainable from Xenorhabdus. 

35 8. A composition according to claim 7 wherein the said 
further pesticidal material comprises a material 
obtainable from B. tiiuringiensis. 
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9. A composition according to claim 8 which further 
comprises cells of B. thuringiensis. 

10. A composition according to claim 8 wherein the 

5 pesticidal materials obtainable from B. thuringiensis 
comprises the delta endotoxin. 

11. A composition according to any one of the preceding 
claims which further comprises an agriculturally 

10 acceptable carrier. 

12. A composition according to claim 10 wherein the 
carrier comprises items of insect diet. 

15 13. A method for killing or controlling insect pests, 
which method comprises administering to a pest or the 
environment thereof a composition according to any one of 
the preceding claims. 

20 14. A method as claimed in claim 12 wherein the pests 
are insects from the order Lepidoptera or Diptera. 

15. A microorganism comprising Xenorhabdus strain NCIMB 
40886. 

25 

16. A microorganism comprising Xenorhabdus strain NCIMB 
40887. 

17. A pesticidal agent which comprises a a toxin 
30 comprising a protein which is encoded by DNA which 

includes SEQ ID No. 1 or a variant or fragment thereof. 

18. An isolated pesticidal agent characterised in that 
it is obtainable from cultures of X. nematophilus or 

35 mutants thereof , has oral pesticidal activity against 
Pieris brassicae, Pleris rapae and Plutella xylostella, 
is substantially heat stable to 55°C, is proteinaceous, 
acts synergistically with B. thuringiensis cells as an 



SUBSTITUTE SHEET (RULE 26) 



6 



WO 98/08388 



PCT/GB97/D2284 



28 

oral pesticide , and is substantially resistant to 
proteolysis by trypsin and proteinase K. 

19. An isolated pesticidal agent as claimed in claim 18 

5 further characterised in that the pesticidal activity is 
substantially destroyed by treatment with sodium dodecyl 
sulphate or acetone or heating to 80 °C. 

20. An isolated pesticidal agent as claimed in claim 18 

10 or claim 19 further characterised in that the agent is an 
extracellular protein. 

21. A recombinant DNA which encodes a pesticidal agent 
according to any one of claims 17 to 20. 

15 

22. A recombinant DNA of claim 21 which comprises the 
sequence of Figure 2 or a variant or fragment thereof. 

23. A recombinant DNA which comprises or hybridises 
20 under stringent conditions with all or part of the 

sequence of Figure 2, and which encodes a pesticidal 
material . 

24. An expression vector comprising a recombinant DNA 
25 according to any one of claims 21 to 23. 

25. A host organism which has been transformed with an 
expression vector according to claim 24. 

30 26. A host organism as claimed in claim 25 which has been 
engineered or selected such that it also expresses other 
pesticidal proteinaceous toxicity enhancing materials 

27. A host organism comprising a nucleotide sequence 
35 coding for a fusion protein comprising a pesticidally 
active portion of an agent as claimed in any one of 
claims 17 to 20 in combination with other pesticidal 
proteinaceous toxicity enhancing materials. 
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28. A host organism as claimed in claim 27 wherein the 
pesticidal toxicity enhancing materials comprise delta- 
endotoxin from B. thuringiensis . 

5 

29. A host organism as claimed in any one of claims 25 to 
289 wherein the host is a plant. 

30. A host organism as claimed in any one of claims 25 to 
10 28 wherein the host is a virus pathogenic to insects. 

31. A fusion protein as expressed by a host as claimed in 
claim 27. 

15 32. An pesticidal composition comprising one or more 
agents as claimed in any one of claims 17 to 20. 



I 
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Fig.2. 
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TCCACAATTG CCGGAGAAAA TCAGTCGGGA ACTGCCGGTC ATTATTCGTC ACTTATTAAA 
CGAATTTGCC GACCAGAATA AGGCTAAAAA ACTGCTACAG 6C6CAA06C6 ACTCGAACGA 
AGCGTTAACG GTAAAGAGTC ATTCGGATCC GCTGTATCGC TTTTGTGGTT ATCTGGTGTC 
TGTCAATGAT ATGACCGGAA TGAAGATGGG CAATAAAAAC ATTAGCCCAC GAGCACCGAG 
ATTGTACTTG TA TCATG CCT ATCTCTCTTT TATGGAAGCG CACGGCTTTC AACGTCCGTT 
AACACTGACT AAGTTTGGTG AATCCATCCC CAAGATTATG CTGGAATACC GGAAGGAGTA 
TCGAAAAGTG CGAACCAAGA AAGGCTATTC CTATAACGTG GAATTATCGG AAGAGGCCGA 
AGAATGGCTA CCGTCAGTGC CTGAGTGTCG AGACTTTAAA TCACCTGTAT AAAACTTTGA 
GCTTTAAGTC TGCACTCCAT ACACAACTTA AAATATCTAA TTGTATTTAA AAGAAAATAA 
TAGATGTATA GTTATTTTTT AACTATACAT AAGCTCTACA TGCTCTTCAT TCGTGTAAAA 
AATGGGTGAA CAGGTGATAC AGTCAGTGAA TATCATATTA ATTACCGTAA ACCCAGATGT 
AGCAAGGCTT TCAGGGAATT GTGCAGAGGG TGCATAACTC AGAGGGTGAA AAAGATTTTC 
AGGGGGGC TT ATGGCAGGTA AACAAAATCA GAAGCAAATA CCGTGCACAA TCTGGTTTTT 
ATTTTTTGGT ACTACCTCAA ATTAA AATGA TGTAATCATC TGATTTTATT TAAGAATAGA 
AGTTAATCAC AATTTCATTG ATGGACTTTC ATTCACACTG GTATAGATAA ATAATTCTGT 
TATATCCTGT TTCATTACGC ATTCATCAGG AGTGCTGTTA CAGGAGACAA GAATGTCACA 
CATCATTTAC TTGTCGTTAA AGGGCAAGAA GCAGGGTTTA ATTTCAGCGG GTTG1TCAAC 
GCCTGAATCA ATTGGAAATC GCTATCAAAA AGGACGTGAA GATCAAATAC AGGTATTGAG 
CCTGAATCAT TCGATGAGCC GTGACCAGAA TGTTAATCAT CAACCCGTCA GTTTTGTGAA 
ACCCATTGAT AAATCCTCTC CCCTGTTTGC TGGATGCCAG TTTTGTGCAT TACAGGACAA 
GCCAGATGGG ACAACTGGAG TTCTTTTATG AAATCAAGCT GACCAGTGCC ACGATTGTGG 
ATATTTCCTA TAATTATCCG GCATTCAATC AATGATAA7G GTGCGATACC CCATGAAGTG 
GTGATGCTCG ATTATAAGTC CATTTCATGC AACCACATCG CCGCAGGACT TCGGGCTACA 
GCATACGCAA TTAGCCGGAA GTGAAGAAGC AAGCCGCTTT TATCTGGGGT CTCGAATGTT 
AAGCCACTTA AGAAGCCGCT GGTTGAAGAA ACCCCGGTAA AACCCGCTAA ACATCATGCC 
CGTTATCGTT GTGTGGATGA TGACGGCAAT CTTTTAACCG AACGCAAGTA TCGGGTTTGC 
CTGCCGGATG GTCAGATAAA AGAAGGAAAG AC7GA7AAAC AAGGTTACAC CCAATGGCAT 

AGGATTAATA CCATGCCAGC 
GAAAACCTGC TTTACGACTT 



CTTACGGATG ACAAAAATAA ACTTGAATTT CA' 
CTATACCGTT CAGACAAAAA TAGAATCCAA CG7ACCTGT: 
AACCATTTAT CGTAAGGATG CAAAAGGAAA T7TCCATATC TTGCTTGATG TTTTTCAGGA 



GAAACTACAG AGTAATTATG AAACACAACA GCA7ATCAC3 CAGGAAATAG ACGACGATCT 




CATGGCTCAA ATATATTTCC 
GAATTAACTT CCGGTAAAGC 



TGAA AGTACAGTTG AAACAAAACC 



TTCTGTGATT TATATTATGC AAATTATGC 
GGCACTGCAA ACCCATTTTA AGAAAATGTA 
CTGTTCGGAG AAAAAACGGG AAAATGCCTG 
TGTCAGCGAC GGGGATAATA CCGTTGACTT AAA7A7CAC7 ATTCCTGAAC GACCTTTTAT 
TGCCAAAGAA TATCCCATTG GTCACCCACA CGATCCATTT GAAAAAAGTA AAATTGAATC 
ATAAATACAG GACAGGTTAT CGAAAAGAAT TTATCCGGAT CAAAATGGAG CAAGTTTATG 
TCAGGGCGCG AGCACACTAT TTTAGCTGCG 7TTT7AAGA7 GATTATCTCT TAATGTTCAG 
TTTTAATAGT GTTTTTATCG AGTGAAATTT AATCGCACAG GCAATTCTTT AGACTTTTAT 
AGAAAACTAA AGAATTAAAG AACAAGATTG AC ATTTTAAG TTCAAATATT AATCAAAGTA 
TGCTCGCGCC CTGAGTTTAT GTGGCCCTGC CGCTTTTTTT TATTGCCTGC CAATAGATAG 
ACCAGATATT TATGAGCAAG CGGCACGAGA ATTATGGCAA TATGGCCGAA CTAAAATTGG 
TCAACTGGAA ATTAAGCCGG GTGAGGGTTG CCGACA7CCT AAAGGTACTT TTTATAATCA 
ATATGGTGAA AGAATATCTG GGTTAGATTG GCTGACATTG GCAAGCCTAA GAGATTCAGA 
AAATATGATG ATGAGGTTGA TGATGAAGTA GCTGGTATTA CAATGTGGGG AAAATTGACA 
GAATGGTTTG AAAAATCAGG GTATGAAAAA GTATTTAGTA ATGTCGGCTT ATCCCATTCT 
AATATAAATG ACATAGTAAC TCTTAGTGAT TACTATAACA AAGGATATCA TGTTGTTACT 
TTGATTTCAG CAGGAATGTT ATCAGATTTT GGTGACATAG AAACATCAGG AAAAAATCAT 
TGGATAGTTT GGGAAGGAGT AGTAGAAAAC TATGAGAAAG AAAATATCAC AAATAATTCA 
GATCTGAATC AATATGTAAA TTTAAATCTG TTTTCATGGG GTAAAGTGGA ACATCAAATT 
AAAAAAAACA AATCACTAGA TTATGTACTC AACCATATTT TTTGAGGGTT GG TITITA AA 
CCAATGAAAT AACATGAAAA AAATATTAAT 7ATTTTTATT TTTTTACTTT ATGGTTGTGG 
TAATCCAACG CCAAAAGTTT TACCAAAATC AGAGTTTCT7 CCTGATGCAG TGATAAATGA 
ACCATATCAG GCATCAATTA CCATCACAGG AGGTGCATTG AATGAAAAAA GCGTTTGGGT 
AAAAATTCAT CCTACTGGCT CAGGACTAAC ATGGAATCCA AAAGATAGTT CTTTCCTATA 
GGG7GGAAAA AAAGAAATAA GAAAAGATTA TCATCATATA AATATAACAG GTACCCCAAA 
GAAGACAGAA TTGATAAAAA TTGAAGTGGT AGGATTTACA TTGGGTACAA TGTACGCACG 
GAAAGAGTTC ACTATAAATT ATACTATAAA AGTAAGGGAA TAATTGTCAC TATCAGAATG 
GTGATTTAAT TCGCCATTTT TATACTTTTG TATACTCTCT CAACATAATC AGGATTCTTT 
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3541 CTTATTATTT TTCATGGTGC TAAAAACGTT TATTGCAAAA ATAAATTAAG TTAATCAGAT 

3601 AAATTATCTG CATTACTGTT ATAATCGATA ACACGATAAC CTGACTTTCT GCCTGTTCTT 

3661 ATGAACTCGA AGATAATCCT TTCTGAGCCT GAACGAATCA CATTGCAACC ACTCGCTTTG 

3721 AATCACCCAC ACCGGGACAT TCGTACGCGA GGAACGGGTT TACTCATGCT TGCCAGAGGG 

3781 AGCAAGCCGT CCCAGATCAC CGCTGAAATC GGATGCAGTC TCCGGGTTAT CTGTAATTGG 

3841 GTTCACATGT GGCACAGATA GCGGGATTAT TCGGCGGTCA TGCCGGAGGC CGGTATCTCG 

3901 CCATGACGCC TGACATGATT GCCACTGCGC TCGAAGCCGC CAGCGCAGAG TCCCTGAOGT 

3961 GCGTCGAAGC CAGGCAGGGT TTCCCTGCCT TGTACGCTTG AAACGCTGGC GAATACCCTG 

4021 AAAAAACAGG GGCTCCCCTA TAAACGCCCC CGCCTGTCGC TTAAAAAAAG CGCAATAAAA 

4081 CGGAGTTTGC TGAAAAATCC GCCTTGCTGA ATAAAATTAA GGCCGGAGCA CAGTCAGGAC 

4141 ATTACCGTCT GGTCTATTTT GAGTTCTGGG GGCGTTAAAT TACACGGATA ACACGCTGTT 

4201 TTACCAGACA ACGTCAGGCA GTATCACGCG AGATGACGTG ATTGA TTTTT TAGAGCCGGT 

4261 GGCCAGACAA GGGACAACCG CCTGACATTT TTAGTGTTGG ATAATGCGCG TATCCATCAC 

4321 GGGATAGAGG AAAAAATCAG AAATGGCGGG TGACGAGAAC ACAACCTGTT TTTATTCTAT 

4381 CTTCCCGCTT ACAGCCCAGA GCTGTATCTG ATTGAAATCG TCTGGAAACA GGCCAAATAC 

4441 GACTGGCGAC GTTTTATCAC CTGGACTCAG GATACAATGG AATATGAGGT AAATACTTTA 

4501 TTGAAAGGTT ATGGCGA CCA ATTTGCAATT AACTTTTCTT GAGTACTTAG TAAGAATAGA 

4561 GTCAGTCGAG GTTTTTTCAT TTCGGGTCGT GGGGATGATA CTGAAAATTT GTTTGTAATC 

4621 TCTGAAAATT GCTGTTTCTG TGGCTACGTC TGTCTTTTGG GATATTGTTT CCATCAAGTC 

4681 TGTCAACATA CTGTTAA GTT AGATGTTGAT AAAAGAGACT GAATTATAAT ACAAAACAAT 

4741 AAATCACTTG GACAATATTT TATTTCACAT GAGACATTAA GGTTGATTTT CCCAATCTGG 

4801 TCAGTTATAA CCGAATAAGG ATCTTGAAAA ATCATGGGAT CTTACTTTTA TCAAATGAAG 

4861 TTAACGTAAA AGTTGATAAA GAAAATTATT TAATTCTAAG TGCC GTTGGC ATAAATATTT 

4921 TGTGTTTTGT TAATGAATGA ATAACCAGGT AAGCTGGATT TTCATTTXTT AATTACTCGT 

4981 TACAATATGC TATTTATTTA TATAAAGAGT TTGTGCCCAT TTAACCAGTA AACAAATTTG 

5041 TTCAACCGTA ACTTAGCTTC ATCGACTTTT GGCCTCGCCT GGTCAGAATC TAGGGCCGTT 

5101 ATCCTATTTA TTTATGATAA ATAAAATTTA ATTATCTTTA ATAAGCTGAA TATGTGGATT 

5161 TGTGCTCAAT CTTGGATTCA AGTATGTATT CCTTTTGGTA CCCTGCTTTA TTTTAAGGCA 

5221 GATGAAGAGG ATGCCAACAT GACACAATAT CGATTACGAC TGTAACATTA AAGTCAGTTA 

5281 TAAATTTTAT GATTAAAATG AAATTTTAGT AGAAAATCGT ATTCTATTCC GCCATTTACA 

5341 ATAGCATCCT CTTTAATATC ATTAATCTCA GATAAAACAA ATAATTACAA TGTGAATAGA 

5401 ATAATGACTT ACAAAATAAG CACTAAATCT TCAGATGAAC TCTTAACTGA CAACACTATT 

5461 TTATAAAATA ATTGAGGTTA TTATGTATAG CACGGCTGTA TTACTCAATA AAATCAGTCC 

5521 CACTCGCGAC GGTCAGACGA TGACTCTTGC GGATCTGCAA TATTTATCCT TCAGTGAACT 

5581 GAGAAAAATC TTTGATGACC AGCTCAGTTG GGGAGAGGCT CGCCATCTCT ATCATGAAAC 

5641 TATAGAGCAG AAAAAAAATA ATCGCTTGCT GGAAGCGCGT ATTTTTACCC GTGCCAACCC 

5701 ACAATTATCC GGTGCTATCC GACTCGGTAT TGAACGAGAC AGCGTTTCAC GCAGTTATGA 

5761 TGAAATGTTT GGTGCCCGTT CTTCTTCCTT TGTGAAACCG GGTTCAGTGG CTTCCATGTT 

5821 TTCACCGGCT GGCTATCTCA CCGAATTGTA TCGTGAAGCG AAGGACTTAC ATTTTTCAAG 

5881 CTCTGCTTAT CATCTTGATA ATCGCCGTCC GGATCTGGCT GATCTGACTC TGAGCCAGAG 

5941 TAATATGGAT ACAGAAATTT CCACCCTGAC ACTGTCTAAC GAACTGTTGC TGGAGCTATT 

6001 ACCCGCAAGA CCGGAGGTGA TTCGGACGCA TTGATGGAGA GCCTGTCAAC TTACCGTCAG 

6061 GCCATTGATA CCCCTTACCA TCAGCCTTAC GAGACTATCC GTCAGGTCAT TATGACCCAT 

6121 GACAGTACAC TGTCAGCGCT GTCCCGTAAT CCTGAGGTGA TGGGGCAGGC GGAAGGGGCT 

6181 TCATTACTGG CGATTCTGGC CAATATTTCT CCAGAACTGT ATAACATTTT GACCGAAGAG 

6241 ATTACGGAAA AGAACGCTGA TGCTTTATTT GCGCAAAACT TCAGTGAAAA TATCACGCCC 

6301 GAAAATTTCG CGTCACAATC ATGGATAGCC AAGTATTATG GTCTTGAACT TTCTGAGGTG 

6361 CAAAAATACC TCGGGATGTT GCAGAATGGC TATTCTGACA GCACCTCTGC TTATGTGGAT 

6421 AATATCTCAA CX5GGTTTAGT GGTCAATAAT GAAAGTAAAC TCGAAGCTTA CAAAATAACA 

6481 CGTGTAAAAA CAGATGATTA TGATAAACAT GTAAATTACT TTGATCTGAT GTATGAAGGA 

6541 AATAATCAAT TCTTTATATG TGCTAATTTT AAGATATCGA GAGAATTTGG GGCGACTCTT 

6601 AGGAAAAACT CAGGGACAAG TGGCATTGTC GGCAGCCTTT CCGGTCCCCT GGTAGCCAAT 

6661 ACTAATTTCA AAAGCAATTA CTTAAGTAAC ATATCTGATA ATGAATACAG AAATGGCGTA 

6721 AAAATATATG CCTATCGCTA TACGTCTTCC ACCAGCGCCA CAAATCAGGG CGGCGGAATA 

6781 TTCACTTTTG AGTCTTATCC CCTGACTATA TTTGCGCTCA AACTGAATAA AGCCATTCGC 

6841 TTGTGCCTGA CTAGCGGGCT TTCACCGAAT GAACTGCAAA CTATCGTACG CAGTGACAAT 

6901 GCACAAGGCA TCATCAACGA CTCCGTTCTG ACCAAAGTTT TCTATACTCT GTTCTACAGT 

6961 CACCGTTATG CACTGAGCTT TGATGATGCA CAGGTACTGA ACGGATCGGT CATTAATCAA 

7021 TATGCCCGAC GATGACAGTG TCAGTCATTT TAACCGTCTC TTTAATACCC CGCCGCTGAA 

7081 AGGGAAAATC TTTGAAGCCG ACGGCAACAC GGTCAGCATT GATCCGGATG AAGAACAATC 

7141 TACCTTTGCC CGTTCAGCCC TGATGCGTGG TCTGGGGATC AACAGTGGTG AACTGTATCA 

7201 GTTAGGCAAA CTGGCGGGTG TATTGGACAC ACAAAATATC CTCACACTTT CTGTCCCTGT 

7261 TATATCTTCA CTGTATCGCC TCACGTTACT GGCCCGTGCC CATCAGCTGA CGGTTAATGA 

7321 ACTG7GTATG CTTTATGGTT TTTCGCCGTT CAATGGCAAA ACAACGGCTT CTTTGTCTTC 
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11221 CAAGATGTGG TATGTGCGTG CTTTGGAATT GCTGG6TGAT GAGCCGGAGG ATTACGGCAG 

11281 CCAACAGTGG GCCGCACCGT LTCTTT C O S T GGCGGGCAAC CACACTGTGC AAGCGGGCTA 

11341 TCAACAAGAC CTTACGGCGC TAGACAACGG AGAAGGTTGC ACTCAACCCC GCAACGCTAA 

11401 CTCGTTGGTG GTTTGGTCCT GCCGGAATAT AACCCGGAAT CAACOGATTA CTGGCAAACC 

11461 TGCGTTTGCG CCTGGTTAAC CTGCGCCATA ATCCTTCCAT GAOGGGCAAC CGTTATCGCT 

11521 GGCGAATTAC GOGAGCCTAC GATCCGAAAG CGCTGCTCAC CAGTATGGTA CAGCCTTCTC 

11581 AGGGCGGTAG TGCAGTGCTG CCCGGCACAT TGTCGTTATA CCGCTTCCCG GTGATGCTGG 

11641 AGCGGGCCCG CAATCTGGTA GCGCAATTAA CCCAGTTCGG CACCTCTCTG CTCAGTATGG 

11701 CAGAGCATGA TGATGCCGAT GAACTCACCA CGTTGCTACT ACAGCAGGGT ATGGAACTGG 

11761 CGACACAGAG CATCCGTATT CAGCAACGAA CTGTCGATGA AGTGGATGCT GATATTGCTG 

11821 TATTGGCAGA GAGCCGCCGC AGTGCACAAA A TCGT C TGGA AAAATACCAG CAGCTGTATG 

11881 ACGAGGATAT CAACCACGGA GAACAGCGTC CGATGTCACT GTTTGATGCG GCGGCAGGTC 

11941 AGTCTCTGGC CGGGCAGGCG CTCTCAGTAG CAGAAGGGGT GGCTGACTTA GTTCCAAACG 

12001 TGTTCGGTTT CGCTTGTGGC GGCAGTCGTT GGGGGGCAGC ACTGCGTGCT TCCGCCTCCG 

12061 TGATGTCGCT TTCTGCCACA GCTTCCCAAT ATTCCGCAGA CAAAATCAGC CGTTCGGAAG 

12121 CCTACCGCCG CCGCCGTCAG GAGTGGGAAA TTCAGCGTGA TAATGCTGAC GGTGAAGTCA 

12181 AACAAATGGA TGCCCAGCTG GAAAGCCTGA AAATACGCGG CGAAGCAGCA CAGATGCAGG 

12241 TGGAATATCA GGAGACCCAG CAGGCCCATA CTCAGGCTCA GTTAGAGCTG TTACAGCGTA 

12301 AATTCACAAA CAAAGCGCTT TACAGTTGGA TGCGCGGCAA GCTGAGTGCT ATCTATTACC 

12361 AGTTCTTTGA CCTGACCCAG TCCTTCTGCC TGATGGCACA GGAAGCGCTG CGCCGCGAGC 

12421 TGACCGACAA CGGTGTTACC TTTATCCGGG GTGGGGCCTG GAACGGTACG ACTGCGGGTT 

12481 TGATGGCGGG TGAAACGTTG CTGCTGAATC TGGCAGAAAT GGAAAAAGTC TGGCTGGAGC 

12541 GTGATGAGCG GGC ACTGG AA GTGACCCGTA CCGTCTCGTT GGCACAGTTC TATCAGGCCT 

12601 TATCATCAGA CAACTTTAAT CTGACCGAAA AACTCACGCA ATTCCTGCGT GAAGGGAAAG 

12661 GCAACGTAGG AGCTTCOGGC A ATGAA TTAA AACTCAGTAA CCGCCAGATA GAAGCCTCAG 

12721 TGCGATTGTC TGATTTGAAA ATTTTCAGCG ATACCCCGGA AAGCTTTGGC AATACCCGTC 

12781 AGTTGAAACA AGTGAGTGTC ACCTTGCCGG CGCTGGTTGG TCCGTATGAA GATATCCGGG 

12841 CGGTGCTGAA TTACGGCGGC AGCATCGTCA TGCCACGCGG TTGCAGTGCT ATTGCTCTCT 

12901 CCCACGGCGT GAATGACAGT GGTCAATTTA TGCTGGATTT CAACGATTCC CGTTATCTGC 

12961 CGTTTGAAGG TATTTCCGTG AATGACAGCG GTAGCCTGAC GTTGAGTTTC CCGGATGCGA 

13021 CTGATCGACA GAAAGCGCTG CTGGAGAGCC TGAGCGATAT CATTCTGCAT ATCCGCTATA 

13081 CCATTCGTTC TTAATTAAAA CATTGTGATA GGCAGGCTCC TGAGGGAGCC TGTTTAAGGA 

13141 GTTTTTATGC AGGGTTCAAC ACCTTTGAAA CTTGAAATAC CGTCATTGCC CTCTGGGGGC 

13201 GGATCACTAA AAGGAATGGG AGAAGCACTC AATGCCGTCG GAGCGGAAGG GGAGCGTCAT 

13261 TTTCACTGCC CTTGCCGATC TCTGTCCGGC GTGGTCTGGT GCCGGTGCTA TCACTGAATT 

13321 ACAGCAGTAC TGCTGGCAAT GGGTCATTCG GGATGGGGTG GCAATGTGGG GTTGGTTTTA 

13381 TCAGCCTGCG TACCGCCAAG GGCGTTCCGC ACTATACGGG ACAAGATGAG TATCTCGGGC 

13441 CGGATGGGGA AGTGTTGAGT ATTGTGCCGG ACAGCCAAGG GCAACCAGAG CAACGCACCG 

13501 CAACCTCACT GTTGGGGACG GTTCTGACAC AGCCGCCTAC TGTTACCCGC TATCAGTCCC 

13561 GCGTGGCAGA AAAAATCGTT CGTTTAGAAC ACTGGCAGCC ACAGCAGAGA CGTGAGGAAG 

13621 AGACGTCTTT TTGGGTACTT TTTACTGCGG ATGGTTTAGT GCACCTATTC GGTAAGCATC 

13681 ATCATGCACG TATTGCTGAC CCGCAGGATG AAACCAGAAT TGCCCGCTGG CTGATGGAGG 

13741 AAACCGTCAC GCATACCGGG GAACATATTT ACTATCACTA TCGGGCAGAA GACGATCTTG 

13801 ACTGTGATGA GCATGAACTT GCTCAGCATT CAG G T G TTAC GGCCCACCGT TATCCTGGCA 

13861 AGTCCACTAT GGCAATACTC AGCCGGAAAC CGCTTTTTTC GOGGTAAAAT CAGGTATCCC 

13921 TGTTGATAAT GACTGGTTGT TTCATCTGGT ATTTGATTAC GGTGAGCGCT TATCTTCGCT 

139B1 GAACTCCGTA CCCGAATTCA ATGTGTCAGA AAACAATGTG TCTGAAAACA ATGTGTCTGA 

14041 AAAATGGOGT TGTCGTCCGG ACAGTTTCTC CCGCTATGAA TATGGGTTTG AAATTCGAAC 

14101 CCGTCGCTTG TGTCGCCAAG TTCTG ATGTT TCATCAGCTG AAAGCGCTGG CAGGGGAAAA 

14161 GGTTGCAGAA GAAACACCGG CGCTGGTTTC CCGTCTTATT CTGGATTATG ACCTGAACAA 

14221 CAAGGTTTCC TTGCTGCAAA CGGCCCGCAG ACTGGCCCAT GAAACGGACG GTACGCCAGT 

14281 GATGATGTCC CCGCTGGAAA TGGATTATCA ACGTGTTAAT CATGGCGTGA ATCTGAACTG 

14341 GCAGTCCATG CCGCAGTTAG AAAAAATGAA CACGTTGCAG CCATACCAAT TGGTTGATTT 

14401 ATATGGAGAA GGAATTTCCG GCGTTACTTT ATCAGGATAC TCAGAAAGCC TGGTGGTACC 

14461 GTGCTCCGGT ACGGGATATC ACTGCCGAAG GAACGAATGC GGTTACCTAT GAGGAGGCGA 

14521 AACCACTGCC ACATATTCCG GCACAACAGG AAAGCGCGAT GTTGTT GG AC ATCAATGGTG 

14581 ACGGGCGTCT GGATTGGGTG ATTAOGGCAT CAGGGTTACG GGGCTACCAC ACCATGTCAC 

14641 CGGAAGGTGA ATGGACACCC TTTATTCCAT TATCCGCTGT GCCAATGGAA TATTTCCATC 

14701 CGCAGGCAAA ACTGGCTGAT ATTGATGGGG CTGGGCTGCC TGACTTAGCG CTTATCGGGC 

14761 CAAATAGTGT ACGTGTCTGG TCAAATAATC CGGCAGGATG GGATCGCGCT CAGGATGTTA 

14821 TTCATTTGTC AAATAAGCCA CTGCCGGTTC CCGGCAAAAA TAAGCGTCAT CTTGTCGCAT 

14881 TCAGTGATAT GACAGGCTCC GGGCAATCAC ATCTGGTGGA AGTTACGGCA AATAGCGTGC 

14941 GCTACTGGCC GAACCTGGGG CATGGAAAAT TTGGTGAGCC TCTGATGATA ACAGGCTTCC 

15001 AAATTACGGG GAAACGTTTA ACCCCCACAG ACTGTATATG GTAGACCTAA ATGGCTCAGG 
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15061 CACCACCCGA TTTTATTTAT GCCCGCAATA CTTACCTTCA ACTCTATGCC AATGAAAGCG 

15121 GCAATCATTC TGCTGAACCT CAGCGTATTG ATCTGCCGGA TGGGGTACGT TTTGATQATA 

15181 CTTGTCGGTT ACAAATAGCG GATACACAAG GATTAGGGAC TGCCAGCATT ATTTTGACGA 

15241 TCCCCCATAT GAAGGTGCAG CACTGGCGAT TGGATATGAC CATATTCAAG CCTTGGCTGC 

15301 TGAATGCCGT CAATAACAAT ATGGGAACAG AAACCACGCT GTATTATCGC AGCTCTGCCC 

15361 AGTTCTGGCT GGATGAGAAA TTACAGGCTT CTGAATCCGG GATGACGGTG GTCAGCTACT 

15421 TACCGTTCCC GGTGCATGTG TTGTGGCGCA CGGAAGTGCT GGATGAAATT TCCGGTAACC 

15481 GATTGACCAG CCATTATCAT TACTCACATG GTGCCTGGGA TGGTCTGGAA CGGGAGTTTC 

15541 GTGGTTTTGG GCGGGTGACG CAAACTGATA TTGATTCACG GGCGAGTGCG ACACAGGGGA 

15601 CACATGCTGA ACCACCGGCA CCTTCGCGCA CGGTTAATTG GTACGGCACT GG CGTACGGG 

15661 AAGTCGATAT TCTTCTGCCC ACGGAATATT GGCAGGGGGA TCAACAGGCA TTTCCCCATT 

15721 TTACCCCACG CTTTACCCGT TATGACGAAA AATCCGGTGG TGATATGACG GTCACGCCGA 

15781 GCGAACAGGA AGAATACTGG TTACATCGAG CCTTAAAAGG ACAACGTTTA CGCAGTGAGC 

15841 TGTATGGGGA TGATGATTCT ATACTGGCCG GTACGCCTTA TTCAGTGGAT GAATCCCGCA 

15901 CCCAAGTACG TTTGTTACCG GTGATGGTAT CGGACGTGCC TGCGGTACTG GTTTCGGTGG 

15961 CCGAATCCCG CCAATACCGA TATGAAGGGG TTGTTACCGA TTCCACAGTG CAGCCAAAAG 

16021 ATTGTCCTTA AATATGATGC GTTAGGATTT CCGCAGGACA ATCTTCAGAT TGCCTATTCG 

16081 AGACGTCCAC AGCCTGAGTT CTCGCCTTAT CCGGATACCC TGCCCGAAAC ACTTTTCACC 

16141 AGCAGTTTCG ACGAACAGCA GATGTTCCTT CGTCTGACAC GCCAGCGTTT TTCTTATCAC 

16201 CATCTGAATC ATGATGATAA TACGTGGATC ACAGGGCTTA TGGATACCTC ACGCAGTGAC 

16261 GCACGTATTT ATCAAGCCGA TAAAGTGCCG GACGGTGGAT TTTCCCTTGA ATGGTTTTCT 

16321 GCCACAGGTG CAGGAGCATT GTTGTTGCCT GATGCCGCAG CCGATTATCT GGGACATCAG 

16381 CGTGTAGCAT ATACCGGTCC AGAAGAGCAA CCCGCTATTC CTCCGCTGGT GGCATACATT 

16441 GAAACCGCAG AGTTTGATGA ACGATCGTTG GCGGCTTXTG AGGAGGTGAT GGATGAGCAG 

16501 GAGCTGACAA AACAGCTGAA TGATGCGGGC TGGAATACGG CAAAAGTGCC GTTCAGTGAA 

16561 AAGACAGATT TCCATGTCTG GGTGGGACAA AAGGAATTTA CAGAATATGC CGGTGCAGAC 

16621 GGATTCTATC GGCCATTGGT GCAACGGGAA ACCAAGCTTA CAGGTCAAAC GACAGTGACG 

16681 TGGGATAGCC ATTACTGTGT TATCACCGCA ACAGAGGATG CGGCTGGCCT GCGTATGCAA 

16741 GCGCATTACG ATTATCGATT TATGGTTGCG GATAACACCA CAGATATCAA TGATAACTAT 

16801 CACACCGTGA CGTTTGATGC ACTGGGGACG GTAACCAGCT TCCGTTTCTG GGGGACTGAA 

16861 AACGGTGAAA AACAAGGATA TACCCCTGCG GAAAATGAAA CTGTCCCCTT TATTGTCCCC 

16921 ACAACGGTGG ATGATGCTCT GGCATTGAAA CCCGGCA7AC CTGTTGCAGG GCTGATGGTT 

16981 TATGCCCCTC TGAGCTGGAT GGTTCAGGCC AGrrTTTrTA ATGATGGGGA GCTTTATGGA 

17041 GAGCTGAAAC CGGCTGGGAT CATCACTGAA GATGG7TATC TCCTGTCGCT TGCTTTTCGC 

17101 CGCTGGCATC AAAATAACCC TGCCGCTGCC A7SCCAAAGC AAGTCAATTC ACAGAACCCA 

17161 CCCCATGTAC TGAGTGTGAT CACCGACCGC 7A7GA7GCCG ATCCGGAACA ACAATTACGT 

17221 CAAACGTTTA CGTTTAGTGA TGGTTTTGGG CG AAA CTCTTA CAAACAGCCG TACGCCATGA 

17281 AAGTGGTGAA GCCTGGGTAC CTGATGAGTA TGGAGCCAAT GTGGCTGAAA ATCAAGGCGC 

17341 CCCTGAAACG GGCGATTACA AATTTCCCGT TGGGC AATTT CCCGGACGTA CAGAATATTA 

17401 ACGGGAAAAG GCAAAGCCCC TGCGTTACG7 7TCAAACCGT ATTCCTGAAA TAATTTGGGC 

17461 AACTATGTCA AGTTGACCAA AAAATGCCCG GCAGGATA7G TATGCCGATA CCCATTACTA 

17521 TGATCCGTTG GGGCGTGAAT ATCAGGTTAT CACGCCAAAG GCGGGTTGCG TCGATCCTTA 

17581 TTCACTCCCT GGTTTGTGGT GAATCAAGTT GAAAATGACA CTCCCGGTGA ATGACAGCAT 

17641 AAAGCTCAGT GATGCCTGTT CACTGAACAG ACATCACTCC ATTTAGGAAT GAATCATGAA 

17701 GAATTTCGTT CACAGCAATA CGCCATCCGT CACCGTACTG GACAACCGTG GTCAGACAGT 

17761 ACGCGAAATA GCCTGGTATC GGCACCCCGA TACACCTCAG GTAACCGATG AACGCATCAC 

17821 CGGTTATCAA TATGATGCTC AAGGATCTCT GACTCAGAGT ATTGATCCGC GATTTTATGA 

17881 ACGCCAGCAG ACAGCGAGTG ACAAGAACGC CATTACACCC AATCTTATTC TCTTGTCATC 

17941 ACTCAGTAAG AAGGCATTGC GTACGCAAAG TGTGGATGCC GGAACCCGTG TCGCCCTGCA 

18001 TGATGTTGCC GGGCGTCCCG TTTTAGCTGT CAGCGCCAAT GGCGTTAGCC GAACGTTTCA 

18061 GTATGAAAGT GATAACCTTC CGGGACGATT GCTAACGATT ACCGAGCAGG TAAAAGGAGA 

18121 GAACGCCTGT ATCACGGAGC GATTGATTTG GTCAGGAAAT ACGCCGGCAG AAAAAGGCAA 

18181 TAATTTGGCC GGCCAGTGCG TGGTCCATTA TGATCCCACC GGAATGAATC AAACCAACAG 

18241 CATATTGTTA ACCAGCATAC CCTTGTCCAT CACACAGCAA TTAGTGAAAG ATGACAGCGA 

18301 AGCCGATTGG CACGGTATGG ATGAATTTGG CTGGAAAAAC GCGCTGGCGC CGGAAAGCTT 

18361 CACTTCTGTC AGCACAACGG ATGCTACCGG CACGGTATTA ACGAGTACAG ATGCTGCCGG 

18421 AAACAAGCAA CGTATCGCCT ATGATGTGGC CGGTCTGCTT CAAGGCAGTT GGTTGGCGCT 

18481 GAAGGGGAAA CAAGAACAAG TTATCGTGAA ATCCCTGACC TATTCGGCTG CCAGCCAGAA 

18541 GCTACGGGAG GAACATGGTA ACGGGATAGT GACTACATAT ACCTATGAAC CCGAGACGCA 

18601 ACGAGTTATT GGCATAAAAA CAGAACGTCC TTCCGGTCAT GCCGCTGGGG AGAAAATTTT 

18661 ACAAAACCTG CGTTATGAAT ATGATCCTGT CGGAAATGTG CTGAAATCAA CTAATGATGC 

18721 TGAAATTACC CGCTTTTGGC GCAACCAGAA AATTGTACCG GAAAATACTT ACACCTATGA 

18781 CAGCCTGTAC CAGCTGGTTT CCGTCACTGG GCGTGAAATG GCGAATATTG GCCGACAAAA 

18841 AAACCAGTTA CCCATCCCCG CTCTGATTGA TAACAATACT TATACGAATT ACTCTCGCAC 
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18901 TTACGACTAT GATCGTGGGG GAATCTGACC AGAATCGCAT AATTCACGAT CACCGGTAAT 

18961 AACTATACAA CGAACATGAC CGTTTCAGAT CACAGCAACC GGGCTGTACT GGAAGAGCTG 

19021 GCGCAAGATC CCACTCAGGT GGATATGTTG TTCACCCCCG GCGGGCATCA GACCCGGCTT 

19081 GTTCCCGGTC AGGATCTTTT CTGGACACCC CGTGACGAAT TGCAACAAGT GATATTGGTC 

19141 AATAGGGAAA ATACGACGCC TGATCAGGAA TTCTACCGTT ATGATGCAGA CAGTCAGCGT 

19201 GTCATTAAGA CTCATATTCA GAAGACAGGT AACAGTGAGC AAATACAGCG AACATTATAT 

19261 TTGCCAGAGC TGGAATGGCG CACGACATAT AGCGGCAATA CATTAAAAGA GTTTTTGCAG 

19321 GTCATCACTG TCGGTGAAGC GGGTCAGGCA CAAGTGCGGG TGCTGCATTG GGAAACAGGC 

19381 AAACCGGCGG ATATCAGCAA TGATCAGCTG CGCTACAGTT ATGGCAACCT GATTGGCAGT 

19441 AGCGGGCTGG AATTGGGACA GTGACGGGCA GATCATTAGT CAGGAAGAAT ATTACCCCTA 

19501 TGGGGGAACC GCCGTGTGGG CACCCGAAAT CAGTCAGAAG CTGATTACAC AAGCCGGCGT 

19561 TATTCTGGCA AAGAGCGGGA TGCAACAGGG TTGTATTACT ACGGCTATCG TTATTATCAA 

19621 TCGTGGACAG GGCGATGGTT GAGTGTAGAT CCTCCCGGTG AGGCCGATGG TCTCAATTTG 

19681 TTCCGAATGT GCAGGAATAA CCCCATCGTT TTTTCTGATT CTGATGGTCG TTTCCCCGGT 

19741 CAGGGTGTCC TTGCCTGGAT AGGGAAAAAA GCGTATCGAA AGGCAGTCAA CATCACGACA 

19801 GAACACCTGC TTGAACAAGG CGCTTCCTTT GATACGTTCT TGAAATTAAA CCGAGGATTG 

19B61 CGAACGTTTG TTTTGGGTGT GGGGGTACAA GTCTGGGGGT GAAGCGGCCA CGATTGCAGG 

19921 AGCGTCGCCT TGGGGGATCG TCGGGGCTGC CATTGGTGGT TTTGTCTCCG GGGCGGTGAT 

19981 GGGGTTTTTC GCGAACAACA TCTCAGAAAA AATTGGGGAA GTTTTAAGTT ATCTGACGCG 

20041 TAAACGTTCT GCTCCTGTTC AGGTAGGCGC TTTTGTTGTC ACATCGCTTG TGACGTCTGC 

20101 ACTATTTAAC AGCTCTTCGA CAGGTACCGC CATTTCCGCA GCAACAGCGG TCACCGTTGG 

20161 AGGATTAATG GCTTTAGCCG GAGAACATAA CACGGGCATG GCTATCAGTA TTGCCACACC 

20222 CGCCGGACAA AGTACGCTGG ATACGCTCAG GCCCGGTAAT GTCAGCGCGC CAGAGCGGTT 

202 Bi AGGGCACTAT CAGGCGCAAT TATTGGCGGC ATATTACTTG GCCGCCATCA GGGAAGTTCT 

20341 GAGCTGGGTG AACGGGCAGC GATTGGTGCT ATGTATGGTG CTCGATGGGG AAGGATCATT 

20401 GGTAATCTAT GGGATGGCCC TTATCGGTTT ATCGGCAGGT TACTGCTCAG AAGAGGCATT 

20461 AGCTCTGCCA TTTCCCACGC TGTCAGTTCC AGGAGCTGGT TTGGCCGAAT GATAGGAGAA 

20521 AGTGTCGGGA GAAATATTTC TGAAGTATTA TTACCTTATA GCCGTACACC CGGTGAATGG 

20581 GTTGGTGCAG CCATTGGCGG GACAGCCGCG GC CGCT CATC ATG CCGTT GG AGGGGAAGTT 

20641 GCCAATGCCG CTAGCCGGGT TACCTGGAGC GGCTTTAAGC GGGCTTTTAA TAACTTCTTC 

20701 TTTAACGCCT CTGCACGTCA TAATGAATCC GAAGCATAAC AATCATGTTC ATTCCCACTT 

207 61 TGTCATGGAT GACAAGGTGG GTTTTTCGGA TGTGTGGACA GAGACCCGTA CAGGGTCTCT 

20821 GTCCAGTTAA TTTTTGGATC AAGAACGAA7 GGTGTAACGG ATATGCAAAA TGATATCGCT 

20861 CAGGCTGAGC AATAAGCTTT TCTGTTTACC AC7GATACCG GGAAAACTGA GGGTTAATGT 

20941 GCCTGTATCG GCCACAGGAA GCCCTTCAAA TGGCAGGTAC TTAGCATCAT TGAAATCCAT 

21001 CTGGAATTGA CCACTGTCAT TCATGCCATG TGAGATCACA ATCGCTTTGC AGCCACGTGG 

21061 CATCATTGTA CTGCCGCCAT AACTCAGTAT TGCCCGGACA TCCTGATAAG GCCCTAAAAG 

21121 GGCAGGTAAC GTCACACTGA TTTGTTTGAT ACGGCGTGTA TTACCTAAAC CGTCAGGATA 

21181 ATCGGTAGCA ATATTCAGAT CCGATAATTT GAGGCTGGCT TGCAGTTGTG TCCCTTCGAC 

21241 GTTCAAACCG TTAAGCGTTG TGCCTGCACT GCCTTCACCT GCATTGACTA ACTCAGTCAC 

21301 TTTATCTTTT AAAATGAAAC TATTTTCTGT CAGACCAGCA TACACTTCAG CCAGAGAAAC 

21361 GGTTCTGGTG ACCTCCAGTG CCCGTTCATC TTTTTCCAAA TAGCTTTTTT CCATCTGTGC 

21421 TAAATTCAGC ATCAGGGTTT CACCCGCTAA TAAACCCGCA TAAGTCCCAT GCCAAGCACC 

214 81 TGGTTTAATA AAGTGTGCTG CCGCATTATT CAATTCATAC TGATAAGTTT GCTCTGCCAT 

21541 TAAACAGAGT GAGACCGCCA AATCATAAAA CTGATAATAA ATAGCGGACA ACGTTCCACG 

21601 GAGCCAGTTG TATAGCGCTG CATTACTGAA TTTACTTTGC AGAAAGGCTA ACTGCGCCTG 

21661 AGTTTGTGCC TGCTGAGTTT CCAGATAGTT TTTTTGTAAT ACTGCCGCTT CACGACGTAC 

21721 AGCCAGCGTC GCTAATTGAG CATCAATTTG TTTTATCTCA GCTTCCGCAT TATTGCGCTG 

21781 AATTTCCCAC TCTTGCCGAC GGCGACGGTA TATTTCTGAT TGGCTGATTT TGTCTGCGGC 

21841 AATACGTGTT GCTGACGCAG AAATTTCGAT ACCAATCGCA CTGGCATTGA AAAGCGCCCC 

21901 AAAACGGGAA CCTCCCACAG CAAAACCGTA AATATTGGGG ACGAGATCTG CCGCGGCGGC 

21961 GGCCATATGC AGGGCTGTGC CGCTGGTGCT CAAGACCGAT GAAGAGAGGT AAAGATCCAT 

22021 CGCTTGTTTT TCACCAGCGT TAACATCTTC GTCGTACAGC GTATTGAAAC TGTCAAAACG 

22081 AGACTGTGCA CCATGACGGC TTTCTTGAAG CGCCAATTTA TCAGCATCAA TTTCAGCCAT 

22141 GACCTTATCC TGCATTTTAA TACTTTGCAG GGCTAACTCA CTGCCTTGAG TTTGCAGTAT 

22201 TTCAGCCAAG GCTTCTGCAT CCTGCCGTTC AGTAATGCTG AGCAGGGTAT TGCCAAATTG 

22261 TATCAACTGG CTTACCCCCC ACTTGGCATT TTCCAGAATC ACCGGAAAAC GGTACATCGG 

22321 CATCACTGCA TGAGGTAAAT CGCCGCCGCC TTGTGAAGCA GTGATGGCAG CACTGAGTAA 

22331 CATGGACGGA TCTGCGGGCG TGGCATAGAG AGATAATGAC AGTGGCTGAC CGTCGATTGT 

22441 CAGGTTATGG CGTAAGTTAT AGAGGCGTTG CGTCAATGTC TGCCAGTAAC CTTGCAGTTT 

22501 TTTATTAATT TGAGGGAGGA ACAATGCGGT TAACGAAATT TGCCGTACGT TTCGTGGGTA 

22561 ATGCAGCGCG CTGACGCAGT TGCAGCATTT TATGTTGATA ATGATGCCGC ATTGTTTGGC 

22621 TGGCAGCTTC TTCCAGCCGT GGCTCTGACC AATCGTTATC CAATGAAAAA TAAGGCTCAT 

226 c 1 CACCCAATAA AGTGAGCGCC TGTACATACC ACATTTTAGC TTCGTTTAAG GTATCACGTT 
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22741 CAAGCTGGCG ATAGGCGCTA 

22B01 TAGCCACTTT ATAGTGCATC 

22B61 GCGGATTCGC ATTCCAGGAC 

22921 GCATTTCACC CTGAAC CGAA 

22981 TCCGGTAAAA TCTGfCTCTTG 

23041 CAATCCCAAG AAATAGATTG 

23101 TCATGACACG ACTTGAATAC 

23161 TGTCATTCCC GAATCATGAT 

23221 AA AATAA AAG AAAGCAGATT 

23281 TAATCTGACA CCTTCACGTA 

23341 GGTTTCAATA TCAGATAACA 

23401 ATTCCCAATA TGGATCTTAA 

23461 CAACTTAAAT GTCGCATAAA 

23521 CATACTTAAA ACATTATCAA 

23581 AGTATTTAAC CGGGTTCTGT 

23641 ATGCAGTGAT ATAACGTTAC 

23701 AATAACAATA TCGTTATAAC 

23761 AATATCATAG CCTTCAATAT 

23821 TTCGTGTGAA GGTTTAGATG 

23881 GCCATTTTTT AATAAAAAAC 

23941 AAGTTTTATT ATCTGTGGCT 

24001 AGTGCCGCAA TATTTCCCAT 

24061 ATAGTATGCC ATACTCCTGT 

24121 AATTCAGCAT CATCTGATTC 

24181 QGA ATTTTTT CCTTGGTTCT 

24241 TCGATGGAAA TAATAAAATC 

24301 ATTTCATCAT GTCTATCATA 

24361 AGGTATTTAA CCTCATCATT 

24421 TTGAACGTTA AATTAATATG 

24481 CCGGTAAAAC TGGCTAATIT 

24541 TCTGTAGCAA ATTGATTGTT 

24601 CTCTTAACAA TGGCGTCTAA 

24661 ATG3TCATCC CTTCTCTTGC 

24721 AAATAAACAA TAATGACGTC 

24781 TTTTTATTGA TCAGGTTTTC 

24841 CTGCCA TCAT AACGAATATG 

24901 TCTTCACTTT CATTTTTCAG 

24961 TAAATAACAG GTCTGATATT 

25021 TCTCCCCAGG CATTGGCAGC 

25081 CAATAATATA TTCTGGGTTC 

25141 ACATTAATAC TGTCATGATA 

25201 GT CAGAT AAT T TTTAAAG CT 

25261 TGCTGTAACA GGTTGTTCAT 

25321 TTTTCCGGAT AATAGGCCAG 

25381 GTTTCCCAGT CGCAGAAGAA 

25441 TTATTCAACG CCCGGTTGAC 

25501 GTTTTCACTT GGGCAGAAAC 

25561 CTCTTAATCT GTTGAGGTGC 

25621 GCTTCATCCA GCCATGCCTG 

25681 A AAGTAC TGG CGGCTTGCCA 

25741 TATTTTAATT TTATGAGTGC 

25801 TCCAGCCATT GCAGAGTGAC 

25861 ACCGGTCTGT TGCAATGCTT 

25921 AATTGTTCGG CAGTCAACGC 

25981 TCACAACGCA TGATCACAGC 

26041 TGCAGTGCTG TGGTTTCTGA 

26101 TTTTCGCTGA GTCCAATATT 

26161 GCCACCATGT TGCTGGTTTC 

26221 TCATCGAATG TCAGTCCTTG 

262B1 GTTTTGGCTG AATCCATTTG 

26341 TCGTCATCAC CGAGTGAAAG 

26401 ATATCCGGCG TAAGGACAGT 

26461 GACCATTTCT GTGTTGTCAG 

26521 AATGCTGTAT CAGAAAAAAG 
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TCTCCGCGGG 
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GGCAATTTTC 



TAATCAACAA 
GGGCAACGGC 
CCAATGGGCG 
GGTTCAGATA 
TGGAATACCA 
GTTTGAAATC 
TTTTTTGATA 
AGTGAATATA 
GTCATAGATA 
TTTAGCATAG 
ATAAGGTTGT 
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TAATTGCGGC 
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T7AAAAGAAT 
TCAGAAGAAC 
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AATATATAAT 
TCCAGCCACA 
ACATTGATGG 
TGCTGGCACT 
TAACCAATTA 
ACCTGCAAGT 
GTATCGATAT 
TGACCAATAC 
CCGGCCCAGG 
TTCACTGGCT 
ATGCTCGCAA 
ATCAGCAGAT 
ATGTAGTAAG 
GATTGTTGAC 
GTTGGCATCG 
TCCGGGGTAA 
TCTCCAGTTG 
CTGAGCATCA 
CAAATGCTGT 
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TCCGGTTTTG 
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CGATCACGGT 
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GCAATCAGCG 
CCATTACTTA 
TCCGTGGTCA 
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GTGTTCACAT 



ATCCAGCATT 
GTCCGGATCG 
GACGTTCCAG 
TAGCGCAGCC 
TCATGGGCGT 
CATGGGTTCA 
TTTTTTACTA 
AATTGATTTT 
ATTTTTTTGT 
GGAACAAAGA 
CTGGCAGAAT 
TCCTCTTTAT 
TCTGGTAAAT 
GCTAATTTTC 
TGTGTGCCTG 
TTTATATGAA 
ATAAACTCGC 
TCATATACCA 
AACTCCACTA 
TGTTCGATCA 
ATGGATCCTC 
CATTAGTAAA 
CTATGTTAGT 
CTCCAATTTC 
AATATTCGGC 
CTTCTTCTTC 
CATCTTCTAC 
AATTAATTTC 
ACGAGAGATC 
CCTTATATTC 
TATCAAGTTC 
ATAATGTCAT 
AATTGTCTTT 
AATACATACC 
AATTAAACGG 
CAGTGATATC 
GTAAATACAA 
GTATTTCAAT 
TTTGGTGATC 
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TAGCGACATC 
TTAACTGACT 
GAATCGTGGG 
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GCAAAGTGGG 
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GCCCCAGTAC 
TAGCCGCAAT 
ACAGCAAAAT 
GGGCAGCTGC 
GTGTCGTGAT 
TCAGAAACAC 
GAAAGCTGAT 
AGGGAGAAAC 



TTCATAAAGG 

ACCGAATCCA 

TAATAATCCT 

AGCGTGTCGA 

TGTAATAGAA 

GTGTTATTTT 

TCCCCTGTTG 

TCGTCTCATC 

A CCCAA CCCC 

GCGTTACTGT 

TGCCATCAAT 

TGTAGGGGGG 

TTTGCGTTTC 

TCGAAAATAA 

ATGGCATTTT 

TTGGCGATTC 

TCACCAGAGG 

TAGGGTCAGA 

GAATATCAGA 

TAGATGAAGC 

GCGAAGGAAC 

TGATTCACAT 

ATCAAGTTTG 

TGATTTTCTA 

ATTTTTAAGA 

TTTTCCAAGC 

CATCGATAAC 

CATTGAAGGA 

AAAAATATTT 

GGCCAAATAA 

TGATAATGTG 

ATCAGGGTTA 

TTTCTCATGG 

AATGCTGGCT 

TGAGCTCCAG 

TATCTTGCCA 

ACGAGACTTG 

TTTTTTCCAT 

GACATTGCGC 

CCCCTCATTG 

TTCAAATGCG 

TTGGGAAAG7 

GTCGATATAG 

TCGATTGTAG 

TCCTTCAACA 

CACACGGGTG 

CTCATCCCGG 



CGCCTGCAAC 
TTCACCGACA 
AGCAGCGACA 
TCACTCCCAA 
GCCACCGCCA 
TCGCGTAGCT 
GAGATCATGT 
GGTCAGTTCG 
CTGACAAAAA 
AATCATGAAA 
AGTTTCTGCT 
ACGGATCAGT 
AAGGTTTTCA 
ATCACTGACA 
TAATTGCGTT 
CGACAACAAC 
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ATGGATAATT 
AGAGACAGGT 
CATAATGTAT 
AAATGGTTTT 
TGGCCATTTA 
GAATATTGGC 
TGCTGCATAT 
TCATCATAAA 
TACAAACGAA 
TAAAGTGTGC 
TGTTATACCG 
AGCAATGGGG 
AAAA TCCAAG 
TTTCATTGGA 
CTTAATCAAT 
ATTCTTATAC 
TTCATCATAA 
ATCATCTGTA 
TACAAGGATT 
GTCCATTAAT 
TAAAATTGCC 
GTAAGTGGAC 
ATGCAGCAGT 
TGTCAGTGCC 
AGGATGAAAA 
GGAAGCAATT 
ACTATTGTTT 
AGCACGGGAC 
TTGATATAAA 
GCTGAAAGAC 
ATCTTATCTA 
TGATTAATGG 
ATTGATACTG 
ATGGATTTAA 
AAATTCAACT 
ATGACTAATA 
GCTACGTTGG 
CAAAAGTTGG 
AACCTCGCAA 
TGATTACTCT 
TTTCCCCGGA 
CCAGCGTTGC 
TTGTTAAACA 
AGGGATCGCT 
AAAACAGAAT 
ATGGGTCTGA 
TGGAACAGTG 
GAATATAGCC 
TGACGAGGAT 
AAGTTAAAAA 
ACGT7GA0GG 
CCTTTAAACT 
ATAACAATTT 
TACACCCATT 
TTTGACATGG 
ATGCAAGGAT 
GAGAAGATTA 
GCAATAGTTA 
ATCTAACAAT 
TTTCGTTTTC 
AACACGCTTT 
GTCCACCTCA 
TGATTTTTGC 
TTC3CCAAAC 
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ACGAAATTTT 

TGGTCAGGTT 

ATTAGCTCTG 

CTTGCCGTTG 

TGTTGATTTG 

GACAA ATCGT 

CCGAAATTTT 

TGATACAATT 

TCCGCTAOGT 

TGACGGGCTG 

AACATTTTCA 

AATTCATTAG 

A GTGAAG CAA 

TCTTTCGCTT 

GATCCCGGTT 

AGGGTTGACG 

AAAATGCGCA 

AAGTCTGTTT 

ATAAACGAAA 

ACAGTTCATT 

TTTTTAGGTG 

ATTTATCGCT 

ATAATGAATA 

CAACAATTGT 

TTTATCTATG 

AGTCAGATAA 

TATATCGTGA 

ATGCGGGGTT 

TATTCTCAGT 

CAGCACAGAC 

TCAAG CAAAA 

GGTTGGTCAC 

TATTACGGAC 

TCAGGGATAA 

CGGCCACTCC 

TTGAAATTGA 

COGCACTCGC 

CAAAACTAAG 

TCAAAACTTA 

ATGTAAATAT 

A7ATTCGGGG 

TGGTTGTGAA 

TTTCTCATCC 

TTAAGCAACT 

TGCCATAGAC 

AATTTGGGAT 

TCTAAAATTA 

TTGATAAATA 

GCGTTGAGGC 

TGTTGCCCTT 

TCCTCATAGA 

CATTTTTTCA 

GATGCCCGTC 



CAGATGATGA ATGTCTGCCA GCAGACGAAC GCGATAAAGC 
ACACATAAAT TCTGGATTTG TTCOGCCATT AGCCAGTTTC 
ATCATTCACT CTGAAAGCAC GTTTCATTAT TCCCAAATAA 
GGGGGTTAAA TCCAGTTTGG TATTATCAGC AGAAAACTCT 
ATTGAACAGC ATTGTAAAAT GACTGGGTTG TTGTTTAGTG 
ATGACACAAT ACCAGCGCAT CGCTGACGCT AATATTATAG 
ATAAAACAGC TTACCCAACA CATTGCTGTC AATGGTTAAG 
TAC TTGC CAG ATATCTTCTG GAGATATGCC TGTGGCTTTA 
CAGCTTTAAC AGGAATATAT CACCGGGAAC TCCATCATTT 
ATAGCATCCG ACGGATTTGG TTAACTCGCC ATAAGCGGAG 
GCTCTCTCGT CAATTTAATG GGAATACTGT AATGGGTATT 
TATCTTGGTA TATATATTCT TTATCTCCAT TCTGGAGACG 
CTS TTTTT TT TACACTCAAA TTATATTTGT ATTCATTTTC 
CATAGTTTAA ATGTGAATCG TAGAAATCTT TGCGGGTTCG 
CCGTATCATT CCCGTCATTG ACCAATGTTA TCAGTTGCTC 
TATTTTTCTT ACCGAAGGAG AGATTGACAA ATAAACTGAG 
AGTAGCGAGC CAAAGAAGCA TAACTCTTAA AAATCAGTAC 
TCTTCATCAG TTCTGTTGAA TTTTCCGGTG TAATTTCTTC 
CAGGCGATAT ATCAGTCTTA ATAGCCAGTA GCGAT G TT G G 
CTGTATTACG GCTAAATGCG GTGAGGTTTT TATCTTGCAA 
ACTCATACGG CAGATGATAG GGTGTCATGC CGGTTTGCCG 
TTACACCGTT ATAGTCAGTT TTCTCTAACG TCTGAATATT 
ATAAGGATAA TGTGGAAATT TCTTCATCCA TATTATTCTG 
TGTCGGGGCG TCGTTTATTC AGGTGATATT GAGAATTGTC 
CCCGATATAA TTCTGTTAAA TAAGCCGCTG GTGAAAATAT 
TTACAAAACG GTG GGCG CGG CCATAAAACC AACTGTTGTA 
GTGTAATATT AAGGTTAGTG ATATTAGCCA GTTGTGGATT 
GTTCTTCAAG TTTATTCTGT TTTGATTCCT GATGAGCCTG 
CTCGCCACGT CAGAGTTCCA CTTGTCCTAT GACGAAATTC 
TGTTTGTCAA TAATAAAGTA TCACCAGCCT TTTTCTATTT 
AACTTTTATC ATATAAATCC TTAAGTTATT GTCAATTTAA 
GAGATTATTA TAATCTGATA GGAATATTAT GGTTAATTAA 
CTATTCTTTC AATAAAAAAT AAAGAACTTC CCTATAATAC 
CCGTATGTTA AAAATTAAAT TTTAACAAAC TTTCATGAAA 
TTAAATATTT TTAATTGTGT TTGTGCTGTT TGAAAAATGA 
AAAGATTATT TATTGAGGAT GTCTTGCTTG GTTTCAGGGG 
ATGTGTGCAA AAAGAAATCC TTAATAAAGT TGCGTAATTA 
CAAGAGTGAT AGTAATGTCA CATAATTTAT TGAATACCCG 
TTTCTTCGCA TAATCAAAGA GAAAGCTATG AAAAAAACAC 
ACCCTTTCTT TTGGTGCTTT GGCACAGCAG GGTGGCTTCG 
TATACTCAGG GTGGATTTAA AGGTCCAACT CCCAACCTGA 
TCTTTTCGTG ATGATGCGTG GGTTGTTCTG GAAGGAAACA 
GAACTCTATG AATTCGCGGC CGCATAATAC GACTCACTAT 
TTATCCGGAA AGCT ATCTGG AACCCCTGTT ACXJCCTGAAT 
CAGTGGTTCT GTTTATGTTG ACATTGATGA TAAGCGCTGG 
AACTGACAAA GTTCGTATCG AAGGTGAAGT GGACAAAGAC 
TGT CAAAACT ATCCGCATAG TGAAATAACT CAAGCACTTT 
GGGGTTTTTT GCTTTCTGGG AGTCGGAAGT TTAACCGTAG 
TTAACGGCAG TGGTCACTGA TTTGGTGCAT AAGTTATCAA 
TTTTTTATTT AATAGAGGAA TGTCACCCTG TAGGTGAATA 
ACAGTATTAT AGTCCTTTGA TATGTTATTA AATTGAAAAA 
GAAATTATTA TGTCAGATGT TCGTAATATT ATTAATGTTG 
TATAAAGCGG ATTTATTTAA ATAAGTTTTC ATAATTGTGA 
CCGGTTTTTG CTGTTGTAAG GAAGCGGTTT CCATGAAGAT 



GCCACATAAA TTGGCAGCAG 



TCACGGTTTC 



GTTCAATTTT ATTCAACCAC GGGCAATAGG TCGGTAAAAA 
TCTTTGCCAG CCAA ACCCTG ACCTTCCGGC TCTTATGAAT 
ACGTGATGGT TTTGGCATTA ACATATTGAT TGTTAATTTC 



AATCTGAGTT CTTTCTCAAG 
AATT5GCAAG GTAGTGTTTT 
TGAAGCACCA GTCTGCACCG 
AGACCGGGTG TTTCTCTTGA 



CTACCGACAT AAGTGATTTC 
TGGTTCTTTC CZGGGGGTAAC 
ATTTTCGGGT TCAGGTTGAT 
GGCATTCGAT AACGTCTCGC 



TCATACTCAG GGTCAGGCAA TTTTACGGTT GGTGCCGCCC 
CGGCAAAAGT AGCGATAGAG GGTACTTTGA GAGAGCGATG 
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TATTCAGTAG 
AGCCAAAATG 
AGTTCCAGAT 
TTAACCAATT 
GAGAAACCGT 
TATCCCGGGT 
TTATGATTGG 
CGCGAAAATC 
GGAGTCAGCA 
TGATTCATCT 
TGGCCATTAT 
GTTTCAACAT 
AATTATGTCT 
CGGGAACCTT 
AATTAAGTTT 
TTTATGGTTT 
TTATATAGTA 
TAACAGAAAA 
GAGCTGTATT 
GTCAGCAGAT 
AGTAAAGGCG 
AGCCTCACCA 
TTTTATCCCT 
GGAACCGGCA 
GTCGAGACAA 
GATCCGTTAA 
AAAAAAGGAC 
AGGCCTTACC 
CACCTGTATA 
CTGACACATA 
ATGCTGGAGT 
CAGCGCCATG 
TTTGTCCATC 
GAACAGCTTG 
GCTGAAGGAC 
AGTCTGGACA 
CATTAAATGG 
ATTTTATTTA 
TATATAACGG 
CGTGAGCAGC 
CCAATTTCAT 
AA ATCCAG TG 
CCAGTTCAAA 
GCAGACAGAA 
TATAGCGGCG 
GGTGAATTTT 
AGTAATCCAT 
TGGGGGGATA 
CGTAGGTCAT 
CCATTTTGGC 
AGGCGATGTT 
GTTCCGGGTC 
CCACCGCAGT 
GCACGTTGCT 
ATACTGAATG 
AATCCCCCCA 
CTGGCATCAC 
TAAG GATC AA 
TCTGCTTGCC 
TCACCGTTGA 
TCCAGATCAA 
TCTGGCGACA 
AGCGGTTCTG 
GCTGAAGAAT 



CTCATTGATT 

TTGTGGCGAG 

GGCAGGCCTT 

TACCCAACGA 

ACTCCCTTCA 

TTTCTGGATA 

CATGACTCAG 

GGACTGAGTT 

AATGAGTTAT 

ACCGGTGGTA 

ATCAAAGTTA 

GGCAGTTATG 

GGTGATTCAG 

CCACTGATGG 

ATATTTCATC 

TATATTTAAT 

AATAAATTCT 

TTCATGGTTA 

TACTGTAGAA 

AATATGTTGT 

ATTAA TAAC C 

TGACGCGTTA 

T TATCT GCCG 

GCTTTGTGGA 

CCCACGGGGA 

TGGCCTGGCG 

ATACTGAACT 

CTTACTCAAA 

ATCAGCCCCT 

AAAGCATTGC 

GGGTTCCCCA 

TTGTGTTAAG 

AACTGACTGA 

AACAAAAAGG 

GGGAAGAAGG 

TCATTGTCAC 

ATACGCTTTT 

CTACGATTTA 

TCCCATATCA 

ATTTGCCAAC 

TTTGGTTGCA 

ACCACCGTCA 

CTGATTTTTC 

GAATTCCGGA 

CGTTGTCAGA 

CTCCGGTTGT 

GTCCCGATCA 

CAGGTTAGTA 

CACAAAGATA 

AACGACGGCG 

CAGTGCTTCA 

GAATTCATTA 

AAACAT GGGA 

CAGGATCTTT 

CGAGTTCCAG 

GTAAACCGGA 

GGCGCTGATC 

CGGGTACAAT 

GGTTCCACCC 

CGGCCATAAA 

AACCACGGCC 

AA CGCGC ATC 

TATTTTCCGG 

AACTCAAAGG 
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TTAAGTGTAA 

TGCTGTAATA 

CCCGCCGGGA 

TGAACGGAAG 

T GTAACAT CA 

GCTTTTTTCA 

TCCATTTTGG 

CCCTTCAAGT 

TCCCCATAAT 

TGTGGATTCC 

CTTTCAGTAA 

TTTATTTTAC 

CTAAAGGCAA 

TATTGAATAA 

TGGTTTCTGC 

GCCAATCATA 

GTTGGATGTG 

GGAAATTCAA 

CTCGCATTGA 

ATATTGGCTG 

GATAAAACAG 

TTCAAACATT 

GAAGCGATCC 

CAGGCAATTA 

CGGTTACATT 

GCTGATGTAT 

CCCTTTGGTC 

TCGATGGCTG 

GCCGTTGGTG 

CTTGATGGAG 

ATTGGTGGCG 

CTATATTTTA 

ACAATCTCCG 

GCGTGAGCAA 

CAAGCTGGAA 

CAGTACCGGC 

TCACAGCAGG 

CGACGGGTTA 

ATCTTCTCTT 

AGGCCATCAT 

TAAATTCCCT 

GCATTAAAGA 

CCGCGTGCAA 

GCACCTTTTT 

TCAGCACCCA 

ACACCTTGTG 

GGATTGGGCG 

TGGTGACCGA 

TTGTCTAAAT 

CTACAGGCTA 

CGCAGCTCTT 

CCTTCTTCAC 

AAACGCCGGG 

GGCCATCACA 

CTTATGCCCT 

GGCTGCATCC 

CGCGTCCAGA 

ATGGCCTAAT 

GTCAACAACC 

ACTGAAAATC 

GGGGGCATCG 

ATACTGGCAC 

ATCAACTTCA 

AGTTCCGCTG 



TAAGCTCAAG GCTCCATCGT GAACGGAGAT 
AG AAAGAA AT GA CTGTGA AG AGCGGAGCTA 
GGCTTTTAAG TCCTTCCAAC CCGTATAATG 
AACGTGAACA GTGAAGCGTT CTCGAAAOGT 
AGAGCGCGGT GAA GOSACGT GCATAGTCCT 
TCGGACGTCG TTCATTTCGG GGTATTGATG 
GATTTGTTTT GATTTGGOGA TTAATCAGAT 
GATCTACTAT TTTGAAATCT TATTTAATCA 
ACCTGACCAT GTGGTTGTTT ATCCGGGAAA 
TTGGTGCGAT AGTCAGAAAG ATATTGACTC 
AAAGGACGCT GCTGATATTG TGAACTACAT 
AGACAGTAGT AAACAATTTA GCAATAAGCA 
AGGGGATTAT AAGCTTGAAA TTAAAACAAA 
ATATTG ATTC ATTATTATTT ATGGATAAGA 
AATTAAGTTT TAAAAATTAA TTCTACTTTT 
TTATTTTTCT TATAATAATT GATAGTTTAT 
ATTATTATTG TGAGACGGTA ATAATTAACA 
TCAACTTTTG TCOGGTTTCC TGACCATGAA 
TACT GGATT G ATTAGCCGGA CGA GT G TT G G 
TGGATTTTTC AGCGAGATGA TAGCTTTGGC 
AGAGACGGAT TGTGGCCAGG AAAGCAAAAA 
TTTTAACCCA ACCAGAAACC GCCCGGGAAT 
GGTCAGTGTG TGATTTACCA CACTAAAACT 
CGTCAGTTGC ACAGTGATGT GCTGTATTCT 
TATTGCCTGA TTGAACACCA GTCCACGCCT 
TATTCGCTGT CAGCCATGGC TGCGCATCTG 
GTCCCCCTGC TGTTTTATCA TGGTGAGGTG 
GATTGTTTTA CACTCTCTGA ACACGCGGCT 
GATATCAGTG CGCTCAGTGA TGAAGAGATC 
CTGGTACAAA AACATATCCG TTGCCGGGAT 
TTG7TGAA7G CCGGTTATAA TAGCGCCGAA 
CTGAATGGAC ATACGCTGGA TCTCGCCCAG 
GAGCATGAAA CCATGTTGAT GACTATTGCA 
GGCCGGACAG AAGGCAGAAC AGAAGGCAGA 
ACGGCGCGCG CATTATTACG GCATGGTGTC 
CTGAGCCGGG AGAAAATTGA AGCGTTAAAG 
A7AT GGTGAC CCCTGTGAGG CCAC CGGAA A 
CTTTAGGAAG CTGAATGAGA CGTCCTTTGT 
TTCCGCGTAC AGGTAAGTAA CCCAAACCTT 
CCTGATCGCC TGACCAAGAG AAGATCCCGC 
TATGCAGCAC AGTG CGGGG C GTATCCAGTG 
GTGCGTCAGC GTCGGTTTCC GTGTCTGTCA 
TTTCATATTC CGCATCGTAT TGGTTATTCA 
CCATCGTGCC CAGTGGCTCT CCTGTTCTGT 
GACATGAACG TCCATAGTTA GCAAATCCGA 
ACAGTAAAAA GCGGATCGCC TCATCTGCCG 
GAGGAGGGTT ATCGCCGTCA TATTCATATC 
TGTATTCTGC CCAACCGGTA CCAAAGAAGT 
AAGGTGCGAT TTCTTTGAAG CTGGACTTCT 
TCGTGATTTC TTTACGGGCC CGGGTTCCAA 
TCACTAACAA AACATAGTTT GGGCCATCAT 
CTGTGGCGCC GGGGTATTCC CAGTCGATAT 
AAGAAGTCGA CGATGCTACT CACAAATGTA 
GAGAAATACC CTGACA TACT CCAGCCGCCG 
GCCTGTTTTG CTCGCGCTTT CAGATTACGC 
TGATTGTAAT ATTGCAAGAA ATTCTTCGGG 
CCGACATTGC GTGTGGTGCC TAAATCACCA 
GTAATAGGGG CAATCTGGCC ACTGCTGGCT 
TCATTAATCC GTTCGGATAA CTTGCCTTTG 
AGGCGGTCGT AGGCGGTAGG CGGGATTTTT 
TCGCTGGTCA GCGCA G TGTT ATCCTGGGTT 
CAGTCAGTAA TATAGGCAGA GACTTTAGGC 
TATTCGTTGT ACAGGGACTT GGCAACACGT 
CCGTCAGGTT TATATCCCAC CTTCTGATAG 
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GTTTCTTCTG 
GGCGTATTGG 
CAGGCATTCT 
TCAACCCAGC 
CCAGATTGAT 
GCTTTAATTG 
TATTCCGGGG 
AGACGGCTAC 
TAACGCAGAT 
AATTCCAACT 
TCAAAATTGA 
TTAGCCTGTA 
TAATCGATAG 
GCCCGTACAT 
CGCCGATCAG 
AATG AACCCG 
ATGGTTTTTA 
TGTAAGGCTA 
CCACTTTGGA 
AGTAAACAGA 
TCATCCACAT 
CCGAAGCATA 
TCCGCCACAG 
GATTGTAATG 
TAGTGTTGCT 
TGTTGCATGT 
TGCTGCATAC 
GTGCTGCCGT 
AAACGATAAT 
TGGTAACGCT 
AACTCTTTAC 
TGCCACAGTT 
GAGACGGATG 
TCCTGTGCCC 
CAGTTTGCCG 
ATCTGCTTGG 
TCTAACAGAT 
GCTGCAATGG 
TCAA TAAT TG 
TCATCTTCCA 
TCTATATTCA 
ATCCCATTTG 
GTAATAAGCT 
AGTTTTTTTG 
GCGGGGAAGG 
TGCATACCGA 
GGCTTAAAGA 
GGATAGGTTA 
TTATCGTGCA 
TTAAAGTCAG 
ATCAGTTTGA 
TAGGAAAGCG 
TGCCTTAAGG 

TCGCAGACAT 
TGAAGACGGA 
ATCACAGCAA 
TACTCCAGCT 
AGGAACCCAA 
ATCTTTAGTT 
GAAAATGGGG 
CTCGCTCCAG 
TGAACGCCAG 
GCGTTTGATG 



TGAGTGCATC 
GGTTACCGTG 
CATAAACCGG 
CGATGTTTTT 
TCTGCCAGGC 
AAGGGTATCG 
CCGGCTCCTG 
GAACCTGCTG 
GGAG GATAA T 
TCCACTCTGA 
GCCTGAACAT 
CGTTCAGCAA 
TTTTTAAGTC 
CTTCATAAGG 
CGTGGGTCGG 
TACTCTTGCC 
TGATCTCCAG 
ATTTCAGATG 
CGAAATTCAT 
GGGAATTAAC 
GTATAACGCA 
CCGCCAAGAC 
CGGCCGCAGT 
CGATAACTTC 
GGCG CTCCTG 
CAATCGCTTG 
GGAAGTGTTC 
ACTGAATCAG 
TCGGGATTGC 
GCCTGAGCAG 
CATCCAAGGT 
TGAGATATTC 
TATCATATGT 
GTGCTTCACT 
CGAGCAGAAT 
CGACAGTACG 
TCTTGACATA 
CATCCGGATC 
GCCGTGAATT 
GCGGATTAAA 
GTCGCCACGC 
CGCCATTTAA 
GTGTATTCCA 
CAAACAGTGT 
AAAGGAATTG 
TTCTGAACTC 
GAAGTTCGGC 
AATCAAGAAC 
TCAGCCGGAA 
AT TGAGCGAC 
TACTGTCTTT 
TTCTTCCCGG 
CATCCTTGTG 
CATCAGCATA 
AACCGAAGGC 
CAAACGGAAC 
CCATCTGGGC 
GCCATCATAT 
TCACC CGCAC 
TCAGACTGTT 
CGTATCAAAT 
GCATTGG6AG 
TAATATTGGT 
TTAACACCAT 
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ATATTGCAAT 
ATCGGCAATT 
TAAATCAGGT 
AAAAACCGCG 
AACCAGAGAT 
ATAAACATTT 
ATATCAGTTA 
GCTGAACTCA 
ATCGCTCAGC 
AATGGCGCCT 
GCOGCTGTCT 

aacgttttcg 

AGCAGTACTG 
CCCCAGCAAT 
ATAATCGCGC 
AATTTCCCAC 
ACGTCTGGTG 
TTCTCCGACC 
ATCATACTGA 
CGCATCATAG 
TCATTGGTAN 
CATCCCCCCG 
GGCGGCTGAC 
CTGCTCGGTG 
AGCGGCCCGT 
CTGTTGCAGA 
AAAATCGGTA 
CGTTTCTGCG 
CCGGCGTTTC 
ATCTTGCGGG 
CAGGTTATGA 
CGTATCAACA 
CACAGGCAGA 
GTTTTCATAC 
ATCAGGGCTG 
TGCACTGGAT 
GAAACGGAAT 
GGTTGGTTCA 
CCAGTAACGC 
AATATAGTGC 
GACCAGAAAT 
ATCAATCGGC 
GCTCAGTACC 
ATTAAGGCGA 
CACCTGATCC 
TTGAGTACAG 
TTTCAGGGTG 
TTTTTCGCTC 
AGAACCGTTG 
AATCTCCAGT 
GCCGAAATCA 
GTTGCCGGAT 
TTCACGTGAG 
TTGGTCATCC 
TTCGTCATAA 
CAGAGCCAGA 
CATCCGGTAT 
TTGGCATAAG 
TAGGCTCAAC 
CTTCAACTTC 
TGGGGTCTAC 
ATAACGCATC 
ATGGCTGTGT 
CTTCATAACC 



ACCTCGGTTT 
TCTTCOGGTG 
GAAATATTGC 
CTATCATAAA 
GCGCCTACTT 
TGAGACATAA 
GAATTGTCTT 
TTACTTCCGC 
GACTCCAGCA 
GTCCCTTCAA 
TCCATGGCCG 

TAAAGCGTAT 
GCGGGCAATG 
AAGA ACATTT 
TGTGATGATG 
TTATGTTGCA 
AGCAGCCCCT 
CCTGTTTCGT 
GCTTGCAGGT 
ANTTGTTCNN 
ACGGCCAGAC 
TGGGCAGCGA 
ATGGAGATGT 
CGGCTGATGG 
TTGCGGGTAA 
TTGTCTTTTT 
GCCTCTTTTG 
ATGCCCGCCA 
CTGATGGGTT 
CGTAAGTTAT 
GGTTTGACAA 
AGTGGCACGT 
AGAGCCACAT 
GTACCCAGTA 
GTCAGCTTAC 
ATTGCTTTCC 
ATTAACATCC 
GGTTTACCTT 
AGCCATTCGG 
GGCATATGGA 
GTAGGGAATG 
TGCGGGATAC 
ATGTTTTGTG 
TGTTCATTGA 
CTGGCACTTT 
ATTCGATTAT 
AGTACCAGTG 
TAATATTGAT 
GTGTCATCAG 
GGGTTCATTC 
AGAGCACCAT 
TAATACCAGA 
GGCAAATCAG 
TCATAATCCT 
AAOGGGTTAT 
TG CAGATG TC 
CGATTTTGAT 
GTTTTGGTTA 
CGTCCAGGCA 
GCTGCCCAAT 
GGTATCAGGA 
ACGGGTACGT 
TGCGATAACT 



TTTCTCCOGG 
TCGCCTCACG 
GGTCGGGAAT 
TGACATACCA 
CGCTGCTGGC 

tttcacttcc 

GTTTTAATTG 
CACTCACATC 
GCTGATCCTG 
AAGGCAGGAA 
TTCAAATCAC 
ATTCCAAGGG 
TGCTGAGTTG 
ACAGCGCTAC 
CGGCGCTCAG 
TCAGTAATGA 
AATACGCCTG 
GATAAAGATC 
ACTGCCAGGA 
AAAGCCGGAG 
NNNNNNNNNN 
CGAAAATATT 
TCACACCTTC 
TTTCATCATA 
TGAGTGCATC 
AGCTGTACAG 
TCTCCAGCAA 
CCCGGCTCAT 
TACGATTAGC 
CATCGTATAA 
ATAGACGCTG 
ATAAATCAGA 
TGCTGACAGT 
CTTGCAGCGT 
ACATATTGAC 
GGTATTCCAT 
GGTAGTGAAT 
GGTACACGGT 
GGTTGCTGGC 
TGGCCTCTTT 
AAAACAGTTC 
AACCGGGTAT 
CCTGACTGGC 
GCGCGTTATC 
GTTTAATCAG 
CATTGCCAAC 
CCGACCCCAG 
GTTGTTCATC 
GATCTTCTAT 
TGCCATGAAC 
CGGTTTGGAT 
AGTACGGTAA 
CCAGGTTGCC 
TAATTTCTAC 
TACCTTTCTT 
GCGGGTCTTG 
TTCGCGCAGA 
CCGGTCAGGA 
TGCAGTGATA 
ATATACAGGC 
GGCAGGTCAA 
TGGCGTATCG 
CCGACAAAGA 
TTCAGGTTAC 



CGGTACATCA 

GACATATTGC 

ATGCCAGCGT 

GGTTTGACCA 

GTCAGACATC 

GGCCCCGTTA 

ATGTTTATTC 

ACGCGCGGTA 

AT OGGAA CCG 

AAGTTCATCA 

CACACCTTGA 

GTTAAGCAAA 

T ACCAGTG AA 

GGTTTTTATA 

TAAG AAAGTG 

TTTTACCGAT 

ATCCATCCGT 

ATTCCAGAGA 

GGCTTCGGCC 

ATTTGGCTGA 

NNNNNNNNNC 

GGGAACCATA 

AGCCGCTCTT 

GAGCGATTTA 

CAATGAAGCC 

CCCCAGTTGC 

ACTCAGTAAC 

GATCGGGGTG 

CACAACACGC 

TCCGGCCGGA 

ATCCAACATT 

CGGTGCGGCA 

AAGCATTAAC 

ACGGGGTTGC 

GGAGTCATAG 

GTCTCCCTGA 

GGGTTCACTG 

GGGTGGAGGA 

CTGA ACAAG T 

TAATCGTTGT 

CCAGAAATAG 

AGGCTGTTCG 

AATGGCGATC 

AGTTTCATCT 

TTCGC GAATA 

ACCACCTTTG 

CTTGATTGAT 

CAAGACAGTA 

CGCACCAAAC 

AAAATTGACA 

TCTCCGGCAA 

TCGATAGGAT 

GACATATTTT 

CAGCAGTGTA 

ATCTGTCCCC 

CTGTATATCC 

ATGGTGGGTG 

ACGGTGTGGG 

ACGCAGTTGT 

GATTATTCAG 

TAGGTTTCCA 

AAAGATTCAG 

AGAACTTATC 

TGACATCTTC 
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Fig.2. 



3B101 AAAATTATTC AGATAACCGA GCACCGCTTC TTCTACAGAA TCT TOG GTAA TTTTTCCCTG 
38161 ATTAAGGGCA CTTTCCAGTT GGAAGAAGAA TTCTGTTTTA TTCAGGCGTA ACAGGGGTTC 
38221 CAGATAGCTT TCCGGATAAG TCCGTAATAA GCGATCCC 



N=unspecified base 
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